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This  dissertation  considers  Bayesian  robustness  in  the  context  of  the  finite  pop- 
ulation sampling.  Although  we  are  concerned  exclusively  with  the  finite  population 
mean,  the  general  methods  are  applicable  to  other  parameters  of  interest  as  well. 

First,  we  consider  some  robust  Bayes  competitors  of  the  sample  mean  and  the 
subjective  Bayes  estimators  of  the  finite  population  mean.  Specifically,  we  have  pro- 
posed some  robust  Bayes  estimators  using  ML-II  priors  from  the  ^-contamination 
class.  Classes  of  contaminations  that  are  considered  include  all  possible  contamina- 
tions and  all  symmetric,  unimodal  contaminations.  These  estimators  are  compared 
with  the  sample  mean  and  subjective  Bayes  estimators  in  terms  of  “posterior  robust- 
ness” and  “procedure  robustness”.  Similar  robust  Bayes  estimators  are  introduced 
in  the  presence  of  auxiliary  information,  and  these  are  compared  again  in  terms 
of  “posterior  robustness”  and  “procedure  robustness”  with  the  ratio  estimator,  and 
subjective  Bayes  estimators  utilizing  the  auxiliary  information.  Also,  we  provide  the 
range  where  the  posterior  mean  belongs  under  e-contamination  models. 
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Second,  we  consider  the  idea  of  developing  prior  distributions  which  provide  Bayes 
rules  that  are  inherently  robust  with  respect  to  reasonable  misspecification  of  the 
prior.  We  provide  the  robust  Bayes  estimators  of  the  finite  population  mean  based 
on  heavy-tailed  prior  distributions  using  scale  mixtures  of  normals.  A Monte  Carlo 
method,  the  Gibbs  sampler,  has  been  used  for  implementation  of  the  Bayesian  pro- 
gram. Also,  the  asymptotic  optimality  properties  of  proposed  robust  Bayes  estimators 
are  proved. 

Finally,  we  address  Bayesian  robustness  in  the  hierarchical  Bayes  setting  for  small 
area  estimation.  We  provide  robust  hierarchical  Bayes  estimators  of  the  small  area 
means  based  on  ML-II  priors  under  the  e-contamination  class  where  the  contamina- 
tion class  includes  all  unimodal  distributions.  Also,  we  provide  the  range  where  the 
small  area  means  belong  under  e-contamination  class  as  well  as  the  density  ratio  class 
of  priors.  For  the  class  of  priors  that  are  uniform  over  a specified  interval,  we  inves- 
tigate the  sensitivity  as  to  the  choice  of  the  intervals.  The  methods  are  illustrated 
using  data  related  to  the  median  income  of  four-person  families  in  the  50  states  and 
District  of  Columbia. 
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CHAPTER  1 


INTRODUCTION 
1.1  Literature  Review 


When  prior  information  about  an  unknown  parameter,  say  9 , is  available,  Bayesian 
analysis  requires  quantification  of  this  information  in  the  form  of  a (prior)  distribu- 
tion on  the  parameter  space  for  any  statistical  analysis.  The  most  frequent  criticism 
of  subjective  Bayesian  analysis  is  that  it  supposedly  assumes  an  ability  to  quantify 
available  prior  information  completely  and  accurately  in  terms  of  a single  prior  dis- 
tribution. Given  the  common  and  unavoidable  practical  limitations  on  factors  such 
as  available  prior  elicitation  techniques  and  time,  it  is  rather  unrealistic  to  be  able 
to  quantify  prior  information  in  terms  of  one  distribution  with  complete  accuracy. 
In  view  of  this  difficulty  in  prior  elicitation,  there  has  long  been  a robust  Bayesian 
viewpoint  that  assumes  only  that  subjective  information  can  be  quantified  only  in 
terms  of  a class  T of  prior  distributions.  A procedure  is  then  said  to  be  robust,  if  its 
inferences  are  relatively  insensitive  to  the  variation  of  the  prior  distribution  over  T. 

The  robust  Bayesian  idea  can  be  traced  back  to  Good  as  early  as  in  1950  (see 
for  example  Good  (1965)),  and  has  been  popularized  in  recent  years,  notably  by  the 
stimulating  article  of  Berger  (1984).  There  is  a rapidly  growing  literature  in  the 
area  of  robust  Bayesian  analysis.  Berger  (1984,  1985,  1990)  and  Wasserman  (1992) 
provide  reviews  and  discussion  of  the  various  issues  and  approaches.  Berger  (1984) 
discusses  the  philosophical  or  pragmatic  reasons  for  adopting  the  robust  Bayesian 
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viewpoint  along  with  a review  of  some  of  the  leading  approaches  to  robust  Bayesian 
analysis.  More  recently,  Berger  (1990)  provides  a review  of  different  approaches  to  the 
selection  of  T and  techniques  used  in  the  analyses.  Wasserman  (1992)  discusses  several 
approaches  to  computation  of  bounds  on  posterior  expectations  for  certain  classes  of 
priors  as  well  as  different  approaches  to  graphical  and  numerical  summarization. 

Various  classes  of  priors  have  been  proposed  and  studied  in  the  literature.  One 
such  class  is  the  e-contamination  class  having  the  form 

r = {vr  : n(0)  = (1  - e)7r0 (0)  + eq(0),  q 6 Q}  (1.1.1) 

where  Q is  the  class  of  allowable  contaminations,  and  no  is  a fixed  prior,  commonly 
called  the  base  prior,  which  can  be  thought  of  as  the  prior  one  would  use  if  a single 
prior  had  to  be  chosen,  e-contamination  priors  have  been  used  by  Huber  (1973), 
Berger  (1984,  1990),  Berger  and  Berliner  (1986),  and  Sivaganesan  and  Berger  (1989) 
among  others.  Berger  and  Berliner  (1986)  studied  the  issue  of  selecting,  in  a data  de- 
pendent fashion,  a “good”  prior  distribution  (the  type-II  maximum  likelihood  prior) 
from  the  e-contamination  class,  and  using  this  prior  in  any  subsequent  analysis.  Siva- 
ganesan and  Berger  (1989)  determined  the  ranges  of  the  various  posterior  quantities 
of  interest  (e.g.  the  posterior  mean,  posterior  variance,  posterior  probability  of  a 
set  (allowing  for  credible  sets  or  tests))  as  the  prior  varies  over  T.  Wasserman  (1989) 
provides  a robust  interpretation  of  likelihood  regions  in  the  case  of  all  possible  contam- 
inations. Also  related  is  Berger  and  Sellke  (1987),  who  carry  out  the  determination 
of  the  range  of  the  posterior  probability  of  a hypothesis  when  e = 1 in  (1.1.1)  (i.e., 
when  there  is  no  specified  subjective  prior  no). 

Another  class  of  interest  has  been  considered  in  DeRobertis  and  Hartigan  (1981) 
to  find  ranges  of  general  posterior  quantities.  This  class  is  of  the  form 


r . Mfr)  < 9\{0i) 
' 7t(02)  ~ 02(02) 


V0i,02} 


(1.1.2) 
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where  g 2 < g\  are  given  positive  functions.  Such  class  is  called  a density  ratio  class 
by  Berger  (1990).  Wasserman  and  Kadane  (1992)  develop  a Monte  Carlo  approach 
to  computing  bounds  on  posterior  expectations  over  the  density  ratio  class.  Cano 
(1993)  considers  the  range  of  the  posterior  mean  in  hierarchical  Bayes  models  for 
several  classes  of  priors  including  e-contamination  class  and  density  ratio  class. 

Learner  (1982)  and  Polasek  (1985)  considers  a class  of  conjugate  priors  with  con- 
straints on  the  domain  of  parameters  to  get  closed  form  expressions  for  posterior 
expectations,  and  did  sensitivity  analyses.  DasGupta  and  Studden  (1988)  consider 
the  above  conjugate  class,  as  well  as  a density  ratio  class  in  the  context  of  Bayesian 
design  of  experiments. 

The  difficulty  of  working  with  a class  of  priors  makes  it  very  appealing  to  find 
prior  distributions  which  provide  Bayes  rules  that  are  naturally  robust  with  respect 
to  reasonable  misspecification  of  the  prior.  Since  Box  and  Tiao  (1968,  1973),  Dawid 
(1973),  and  O’Hagan  (1979),  it  has  been  recognized  that  insensitivity  to  outliers  in 
Bayesian  analysis  can  be  achieved  through  the  use  of  flat-tailed  distributions.  Use  of 
t-priors  when  the  data  are  normal  has  been  specifically  considered  by  West  (1985)  and 
O’Hagan  (1989).  Angers  and  Berger  (1991)  consider  a robust  Bayesian  estimation 
of  exchangeable  means  using  Cauchy  priors.  In  Angers  (1992),  the  Cauchy  prior  is 
replaced  by  t-prior  with  odd  number  of  degrees  of  freedom.  Andrews  and  Mallows 
(1974)  and  West  (1987)  study  scale  mixture  of  normal  distributions  which  can  be 
used  for  achieving  Bayesian  robustness.  The  Student  t family,  double-exponential, 
logistic,  and  the  exponential  power  family  can  all  be  constructed  as  scale  mixtures  of 
normals  (cf  Andrews  and  Mallows  (1974)).  West  (1984)  uses  scale  mixtures  of  normal 
distributions  for  both  the  random  errors  and  the  priors  in  Bayesian  linear  modelling. 
Carlin  and  Poison  (1991)  discuss  Bayesian  model  specification  and  provide  a general 
paradigm  for  the  Bayesian  modelling  of  nonnormal  errors  by  using  scale  mixtures  of 
normal  distributions. 
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The  references  so  far  pertain  to  Bayesian  analysis  for  infinite  populations.  Bayesian 
analysis  in  finite  population  sampling  is  of  more  recent  vintage.  A unified  and  elegant 
formulation  of  Bayes  estimation  in  finite  population  sampling  was  given  by  Hill  (1968) 
and  Ericson  (1969).  Since  then,  there  are  many  papers  in  the  area  of  Bayes  estimation 
in  finite  population  sampling.  But  most  of  the  Bayesian  literature  in  survey  sampling 
deals  with  subjective  Bayesian  analysis  in  that  the  inference  procedure  is  based  on  a 
single  completely  specified  prior  distribution. 

The  need  for  robust  Bayesian  analysis  in  survey  sampling,  however,  has  been  felt 
by  some  authors,  although  the  methods  described  in  the  first  few  paragraphs  of  this 
section  have  not  so  far  been  used  to  achieve  this  end.  Godambe  and  Thompson 
(1971)  adapted  a framework  whereby  the  prior  information  could  only  be  quantified 
up  to  a class  T (C  in  their  notation)  of  prior  distributions.  For  estimating  the  pop- 
ulation total  in  the  presence  of  auxiliary  information,  they  came  up  with  the  usual 
ratio  and  difference  estimators,  justifying  these  on  the  ground  of  location  invariance. 
The  model  assumption  there  played  a very  minimal  role,  the  main  idea  being  that 
model-based  inference  statements  could  be  replaced,  in  the  case  of  model-failure  by 
design-based  inference.  In  a later  study,  Godambe  (1982)  considered  the  more  com- 
mon phenomenon  of  specific  departures  from  the  assumed  model.  His  contention 
there  was  that  sampling  designs  could  be  a useful  companion  of  model-based  infer- 
ence procedures  to  generate  “near-optimal”  and  “robust”  estimators.  However,  the 
basic  model  assumed  in  that  paper  considered  2/i,  * * * , 2/jv  to  be  independent,  and 
attention  was  confined  only  to  design  and  model  unbiased  estimators.  Royall  and 
Pfefferman  (1982)  also  consider  robustness  of  certain  Bayes  estimators  in  finite  popu- 
lations. However,  their  main  concern  is  to  find  out  conditions  under  which  the  Bayes 
estimators  under  an  assumed  model  remain  the  same  under  certain  departures  from 
the  model. 
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Much  of  the  earlier  work  as  well  as  some  of  the  current  work  in  finite  population 
sampling  relates  to  inference  for  a single  stratum.  However,  there  is  a recent  need 
for  simultaneous  inference  from  several  strata,  particularly  in  the  context  of  small 
area  estimation  which  is  becoming  increasingly  popular  in  survey  sampling.  Agencies 
of  the  Federal  Government  have  been  involved  in  obtaining  estimates  of  population 
counts,  unemployment  rates,  per  capita  income,  crop  yields  and  so  forth  for  state  and 
local  government  areas.  In  typical  instances,  only  a few  samples  are  available  from 
an  individual  area,  and  an  estimate  of  a certain  area  mean  or  simultaneous  estimates 
of  several  area  means  can  be  improved  by  incorporating  information  from  similar 
neighboring  areas.  Ghosh  and  Rao  (1994)  have  recently  surveyed  an  early  history  as 
well  as  the  recent  developments  on  small  area  estimation. 

Bayesian  methodology  has  been  implemented  in  improving  small  area  estimators. 
Empirical  Bayes  (EB)  approach  has  been  considered  for  simultaneous  estimation  of 
the  parameters  for  several  small  areas  where  each  area  contains  a finite  number  of 
elements,  by  Fay  and  Harriot  (1979),  Ghosh  and  Meeden  (1986),  and  Ghosh  and 
Lahiri  (1987).  Datta  and  Ghosh  (1991),  as  an  alternative  to  the  EB  procedure, 
propose  hierarchical  Bayes  (HB)  approach  for  estimation  of  small  area  means  under 
general  mixed  linear  models,  and  also  discuss  the  computational  aspects.  To  handle 
the  case  of  heavy  tailed  priors  in  the  setting  of  Fay  and  Herriot,  Datta  and  Lahiri 
(1994)  use  t-priors  viewing  them  as  scale  mixture  of  normals. 

1.2  The  Subject  of  This  Dissertation 

This  dissertation  considers  Bayesin  robustness  in  the  context  of  the  finite  popu- 
lation sampling.  Although  we  are  concerned  exclusively  with  the  finite  population 
mean,  the  general  methods  are  applicable  to  other  parameters  of  interest  as  well. 

In  Chapter  2,  we  develop  some  robust  Bayes  estimators  of  the  finite  population 
mean  using  ML-II  priors  under  ^-contamination  models.  We  compare  the  performance 
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of  these  estimators  with  the  subjective  Bayes  estimators  as  well  as  the  sample  means 
using  the  criteria  of  “posterior  robustness”  and  “procedure  robustness”  as  suggested 
by  Berger  (1984).  As  a consequence  of  the  results  on  procedure  robustness,  we  have 
established  some  asymtotic  optimality  of  the  robust  Bayes  estimators  even  in  the 
presence  of  a subjective  prior.  Furthermore,  modifying  slightly  the  arguments  of 
Siverganesan  and  Berger  (1989),  we  have  provided  the  range  where  the  posterior 
mean  belongs  under  e-contamination  models. 

In  Chapter  3,  we  provide  expressions  for  variations  of  the  posterior  mean  within 
the  e-contamination  class  of  priors  in  the  presence  of  auxiliary  information  following 
Siverganesan  and  Berger  (1989).  Moreover,  similar  robust  Bayes  predictors  using  the 
ML-II  priors  under  this  contamination  class  of  priors  are  introduced  in  the  presence 
of  auxiliary  information.  We  compare  these  estimators  again  in  terms  of  “posterior 
robustness”  and  “procedure  robustness”  with  the  classical  ratio  estimators,  and  a 
subjective  Bayes  predictors  utilizing  the  auxiliary  information.  In  the  course  of  cal- 
culating indices  for  procedure  robustness,  we  have  proved  the  asymptotic  optimality 
of  the  robust  Bayes  predictors. 

In  Chapter  4,  we  propose  certain  robust  Bayes  estimators  of  the  finite  population 
mean  from  a different  perspective.  In  order  to  overcome  the  problem  associated  with 
outliers  in  the  context  of  finite  population  sampling,  we  develop  Bayes  estimators 
of  the  finite  population  mean  based  on  heavy-tailed  priors  using  scale  mixtures  of 
normals.  Also,  we  study  the  asymptotic  optimality  property  of  the  proposed  Bayes 
estimators.  For  implementation  we  have  devised  a computer-intensive  fully  Bayesian 
procedure  which  uses  Markov  chain  Monte  Carlo  integration  techniques  like  the  Gibbs 
sampler. 

Chapter  5 addresses  Bayesian  robustness  in  the  hierarchical  Bayes  setting  in  the 
context  of  small  area  estimation.  We  provide  the  robust  hierarchical  Bayes  estimators 
of  the  small  area  means  based  on  ML-II  priors  under  the  e-contamination  class  of 
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priors  where  the  contamination  class  includes  all  unimodal  distributions.  We  compare 
these  estimators  with  HB  and  EB  estimators.  Also,  we  provide  the  range  where  the 
small  area  means  belong  under  e-contamination  class  as  well  as  the  density  ratio  class 
of  priors. 

Finally,  in  Chapter  6,  we  discuss  some  open  problems  which  will  be  topics  for 
future  research. 


CHAPTER  2 


ROBUST  BAYES  ESTIMATION  OF  THE  FINITE  POPULATION  MEAN 


2.1  Introduction 


Consider  a finite  population  U with  units  labeled  1,2,  ...,7V.  Let  y,  denote  the 
value  of  a single  characteristic  attached  to  the  unit  i.  The  vector  y = (2/1 , * * * , Vn)t 
is  the  unknown  state  of  nature,  and  is  assumed  to  belong  to  O = RN . A subset  s of 
{1,2,...,  N}  is  called  a sample.  Let  n(s)  denote  the  number  of  elements  belonging  to 
s.  The  set  of  all  possible  samples  is  denoted  by  S.  A design  is  a function  p on  S such 
that  p(s)  € [0, 1]  for  all  s € S and  Y,sesP(s)  = L Given  y € 0 and  s = {A,  • • • , in(s)} 
with  1 < i\  < • • • < in{s)  < AT,  let  y(s)  = { y fl,  • • • , yin  }.  One  of  the  main  objectives 
in  sample  surveys  is  to  draw  inference  about  y or  some  function  (real  or  vector  valued) 
7 (y)  of  y on  the  basis  of  s and  y{s). 

Bayes  estimators  of  the  finite  population  total  (or  the  mean)  within  the  class  of  lin- 
ear unbiased  estimators  were  obtained  by  Godambe  (1955).  Subsequently  Godambe 
and  Joshi  (1965)  found  such  estimators  within  the  class  of  all  unbiased  estimators. 
General  Bayesian  approach  for  finite  population  sampling  was  initiated  by  Hill  (1968) 
and  Ericson  (1969).  Since  then,  a huge  literature  has  grown  in  this  area.  Ericson 
(1983)  provides  a recent  account. 

We  have  discussed  the  use  of  a single  subjective  prior  in  Section  1.1.  In  this  chap- 
ter, we  generate  certain  robust  Bayes  estimators  using  e-contamination  priors,  and 
study  their  performance  over  a broad  class  of  prior  distributions  on  the  parameter 
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space.  As  we  mentioned  in  Section  1.2,  we  have  restricted  ourselves  to  the  estimation 
of  the  finite  population  mean.  It  may  be  noted  that  in  our  framework,  the  sam- 
pling design  does  not  play  any  role  in  the  robustness  study.  More  generally,  in  the 
Bayesian  framework,  the  irrelevance  of  the  sampling  design  at  the  inference  stage  has 
been  pointed  out  earlier  by  Godambe  (1966),  Basu  (1969)  and  Ericson  (1969).  Our 
findings  are  consistent  with  that.  This,  however,  does  not  undermine  the  importance 
of  sampling  design  in  the  actual  choice  of  units  in  a survey.  It  is  very  crucial  to  design 
a survey  before  its  actual  execution,  and  we  have  made  some  comments  on  why  a 
simple  random  sample  is  justified  even  within  our  framework. 

Royall  and  Pfefferman  (1982)  also  consider  robustness  of  certain  Bayes  estima- 
tors. However,  their  main  concern  is  to  find  out  conditions  under  which  the  Bayes 
estimators  under  an  assumed  model  remain  the  same  under  certain  departures  from 
the  model. 

The  contents  of  the  remaining  sections  are  as  follows.  In  Section  2.2,  we  develop 
some  robust  Bayes  estimators  of  the  finite  population  mean  using  ML-II  priors  under 
e-contamination  models  (see  Good  (1965)  and  Berger  and  Berliner  (1986))  where 
the  contamination  class  includes  all  possible  distributions.  As  in  Berger  (1984),  the 
concepts  of  posterior  and  procedure  robustness  are  introduced,  and  the  proposed 
robust  Bayes  estimators  are  compared  to  the  sample  means  and  the  subjective  Bayes 
estimators  under  this  criterion.  It  turns  out  that  the  robust  Bayes  estimator  enjoys 
good  posterior  as  well  as  procedure  robustness  property  relative  to  the  sample  mean, 
while  the  subjective  Bayes  estimator  enjoys  good  posterior  robustness,  but  lacks  in 
procedure  robustness.  As  a consequence  of  the  results  on  procedure  robustness,  we 
have  established  some  asymtotic  (as  the  sample  size  increases)  optimality  of  the  robust 
Bayes  estimators  in  the  presence  of  a subjective  prior.  Specifically  we  have  shown 
that  the  difference  of  the  Bayes  risks  of  the  proposed  robust  Bayes  estimator  and  the 
optimal  Bayes  estimator  converges  to  zero  at  a certain  rate  as  the  sample  size  tends 
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to  infinity.  Also,  modifying  slightly  the  arguments  of  Siverganesan  (1988),  we  have 
provided  the  range  where  the  posterior  mean  belongs  under  e-contamination  models. 
This  pertains  to  the  sensitivity  analysis  of  the  proposed  robust  Bayes  procedure. 

Section  2.3  contains  the  more  realistic  framework  when  the  contamination  class 
contains  only  symmetric  unimodal  distributions.  We  have  developed  robust  Bayes 
estimators  as  competitors  of  the  classical  estimator  as  well  as  the  subjective  Bayes 
estimator,  and  have  studied  their  properties. 

Finally,  in  Section  2.4,  a numerical  example  is  provided  to  illustrate  the  methods 
described  in  Sections  2.2  and  2.3. 

One  of  the  highlights  of  this  chapter  is  the  analytical  study  of  the  procedure 
robustness  and  the  subsequent  asymptotic  optimality  in  the  presence  of  a subjective 
prior.  To  our  knowledge,  this  issue  has  not  been  addressed  before  in  the  previous 
work. 

For  simplicity,  in  the  subsequent  sections,  only  the  case  where  p(s)  > 0 if  and  only 
if  n(s)  = n will  be  considered.  This  amounts  to  considering  only  fixed  samples  of  size 
n.  Also,  throughout  this  chapter,  the  loss  is  assumed  to  be  squared  error. 

2.2  Robust  Baves  Estimators 

Consider  the  case  when  \9  ~ N(9,a2)  (i=l,  ...,  N)  and  9 ~ N(ii0,Tq).  Write 
M0  = cr2/ro,  B0  = M0/(M0  + n),  y(s)  = n^EsesVi  and  y(s)  = {y{  : i s}, 
the  suffixes  in  y(s)  being  arranged  in  the  ascending  order.  From  Ericson  (1969),  it 
follows  that  the  posterior  distribution  of  y(s)  given  s and  y(s)  is  iV({(  1 — B0)y(s ) + 
B0/io}lAr-n,<72(I/v_n  + (Mo-l-n)_1Jjv_n)),  where  lu  is  a u-component  column  vector 
with  each  element  equal  to  1,  Ju  = lul^  and  Iu  is  the  identity  matrix.  Then  the 
Bayes  estimator  of  7 (y)  = iV_1  Vi  is 


S°{s,y{s))  = El[7(2/)|s,y(s)]  = fy{s)  + (1  -/){(!-  B0)y(s)  + B0y0} 
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= y{s)  - (1  - f)B0(y(s)  - /i0),  (2.2.1) 

where  / = n/N.  Strictly  speaking,  we  should  call  5°  a predictor  of  7 (y)  since  5°  is 
the  sum  of  the  observed  yi  s plus  a predictor  of  the  total  of  the  unobserved  y^s,  but 
we  shall  use  the  two  terms  “estimators”  and  “predictors”  interchangeably.  Also,  the 
associated  posterior  variance  is  given  by 

^(7(y)k  y{s))  = N~2(N  - n)a2(M0  + N)/{M0  + n).  (2.2.2) 

The  classical  estimator  of  7 (y)  is  Sc(s,y(s))  — y(s)  which  is  a design-unbiased 
estimator  of  7 (y)  under  simple  random  sampling.  Also,  Sc  is  a model-unbiased 
estimator  of  7 (y)  under  any  model  which  assumes  that  the  y^ s have  a common 
mean. 

To  derive  robust  Bayes  estimators  of  7 (y),  we  first  introduce  the  notion  of  e- 
contaminated  priors.  Denote  7r0  by  the  N(y0, Tq)  distribution.  The  class  T*  of  prior 
distributions  is  given  by 


T*  = {-7T  : 7r  = (1  - e)7r0  + eq,  q E Q},  (2.2.3) 

where  0 < e < 1 is  given,  and  Q is  the  class  of  all  distribution  functions.  We  denote 
m(y|7r)  by  the  marginal  (predictive)  density  of  y under  the  prior  n.  If  ir  G T*,  we 
can  write 

m(yk)  = (1  - e)m(y\ir 0)  + em(y\q). 

This  leads  to 

m(y(s) |7r)  = (1  - £)m(y(s)\n0)  + em(y(s)\q).  (2.2.4) 

Our  objective  is  to  choose  the  prior  7r  which  maximizes  m(y(s)\n)  over  T*.  This 
amounts  to  maximization  of  rn(y(s)\q)  over  q € Q.  Noting  that 


m 


/OO 
-00 


(27TCT2)  2 exp[—  - 0)2/(2cr2)]9(d#)> 


(2.2.5) 
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it  follows  that  m(y(s)\q)  is  maximized  with  respect  to  the  prior  which  is  degenerate 
at  y(s).  We  shall  denote  this  prior  q by  5y(s)i  $ being  the  dirac-delta  function.  The 
resulting  (estimated)  prior,  say  ns,  is  now  given  by 

TTs  = (1  - eKo  + etifa).  (2.2.6) 

The  prior  ns  is  called  ML-II  prior  by  Good  (1965),  and  Berger  and  Berliner  (1986). 
Using  the  prior  7rs,  y(.s)  is  marginally  distributed  as 

(1  — e)N  (/x0ln,  °2In  + Tq  J n)  + eFs,  (2.2.7) 

where  Fs  has  (improper)  pdf 

f(Vi\  i£s)  = (2? rcr2)~2  exp[-  ~ y(s))2/(2<r2)].  (2.2.8) 

We  now  prove  the  following  theorem,  which  provides  the  posterior  distribution  of  y(s) 
given  s and  y(s)  under  the  ML-II  prior  ns. 

Theorem  2.2.1  Under  the  ML-II  prior  ns,  the  conditional  distribution  of  y(s)  given  s 
and  y(s)  is 

A ml{v{s ))  iV({(l  — B0)y(s)  + B0y,0}lN_n , cr2(J^_n  + (M0  + n)_1  J u-n)) 

+(1  - AML(y(s)))  N(y(s)lN-n,  a2IN_n),  (2.2.9) 

where 

A ml(v(s ))  = 1+  e(l  - exp (nB0(y(s)  - //0)2/(2 a2)).  (2.2.10) 

Proof  of  Theorem  2.2.1  The  conditional  pdf  of  y(s)  given  s and  y(s)  is 

ns(y(s)\s,y(s))  = J f(y{s)\6)T?s{6\s,y(s))d6,  (2.2.11) 

where 


fs(0|s,y(s))  = A(j/(s))7ro(0|s,y(s))  + (1  - A(t/(s)))<f(0|s, y{s)), 


(2.2.12) 
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and 

Kv(s))  = (1  - £)m(y(s)\n0)/m(y(s)\ns).  (2.2.13) 

From  Ericson(1969),  7r0(y(s)|s,  y(s))  is  the  N[{(1  - B0)y(s)  + B0fi 0}lN-n, 

&2{lN-n  + (A/o  + n)  1 J jv— n)]  pdf.  Also, 

/ /(kmiwi*.  »(*))<»  = / f(v(mf(y(s)mmdo/m(y(sm 

= ™(y\q)/m(y(s)\q) 

= (2ira2)-^  exp[—  £(y«  - y(s))2/(2a2)](2.2.14) 

Further,  after  some  algebraic  simplifications, 

^-1(2/(s))  = 1 +e(l  ~ ^y1m{y{s)\q}/m(y(s)\n0) 

= A^L(y(s)).  (2.2.15) 

This  completes  the  proof  of  the  theorem. 

Note  that  under  the  posterior  distribution  given  in  (2.2.9),  the  Bayes  estimator  of 
7 (y)  is  given  by 

5RB(s,y(s))  = N-1[ny{s)  + (N-n){XML(y{8))({l  - B0)y{s)  + B0n0) 

+(1  - A ML(y(s)))y(s)}] 

= y(s)  - (1  - f)^ML(y{s))B0(y(s)  - Ho).  (2.2.16) 

Note  that  for  e close  to  zero  i.e.  when  one  is  very  confident  about  the  N(y 0,  Tq)  prior 
for  0,  since  A ml(?/(s))  is  close  to  1,  it  follows  that  5RB  is  very  close  to  6°.  On  the 
other  hand,  when  e is  close  to  1,  i.e.  there  is  very  little  confidence  in  the  iV(/xo,Tg) 
prior,  A ml{v{s))  is  close  to  zero,  and  5RB  is  very  close  to  the  sample  mean  y(s).  Thus, 
as  one  might  expect,  a robust  Bayes  estimator  serves  as  a compromise  between  the 
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subjective  Bayes  and  the  classical  estimators  of  7 (y).  Also,  generalizing  the  formula 
(1.8)  in  Berger  and  Berliner  (1986),  i.e., 

V"(x)  = \(x)V*°(x)  + (1  - \(x))V*(x)  + A(x)(l  - \{x)){6no(x)  - 59(a:))2, 
one  gets 
V(7(v)k»(«)) 

= N~2[(N  - ny  + (N-  n)~2 {o2 + A«l(1  - Xml)B20(§(s)  - W)2}]. 

m 0 + n 

(2.2.17) 

Next,  we  compare  the  performances  of  5°,  5C  and  SRB  from  the  robustness  per- 
spective. The  main  idea  is  that  we  want  to  examine  whether  these  estimators  perform 
satisfactorily  over  a broad  class  of  priors. 

With  this  end,  for  a given  prior  £,  denote  by  p(£,  (s,y(s)),a)  the  posterior  risk  of 
an  estimator  a(s,y(s))  of  y(y),  i.e.,  p(f,  (s,y(s)),a)  = E[{a(s,y(s))  - 7(y)}2|s,  ?/(«)]• 
The  following  definition  is  taken  from  Berger  (1984). 

Definition  2.2.1  An  estimator  ao(s,y(s))  is  ^-posterior  robust  with  respect  to  T if  for 
the  observed  (s,y(s)), 

PORr(a0)  = sup  |p(f , (s,  y(s)),  a0)  - inf  p(^,  (s,  y(s)),  a) | < C-  (2.2.18) 

We  shall,  henceforth,  refer  to  the  left  hand  side  of  (2.2.18)  as  the  posterior  robustness 
index  of  the  estimator  a0(s,y(s ))  of  7 (y)  under  the  class  of  priors  T.  PORr(a0)  in  a 
sense  is  the  sensitivity  index  of  the  estimator  a0  of  7 (y)  as  the  prior  varies  over  T.  For 
any  given  ( > 0,  it  is  very  clear  that  whether  or  not  posterior  robustness  exists  will 
often  depend  on  which  (s,y(s))  is  observed.  This  will  be  revealed  in  the  examples  to 
follow. 
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To  examine  the  posterior  robustness  of  5°,  5C  and  6RB,  consider  the  class  of 
Af(p,r2)  priors,  p (real)  and  r2(>  0).  Write  M = cr2/r2,  B = M/(M+n),  where 
<j2(>  0)  is  known.  Calculations  similar  to  (2.2.1)  now  give  the  Bayes  estimator  of 


7 (y)  under  the  N(p,  r2)  prior  (to  be  denoted  by  ^,b)  as 

^B(s,y(s))  = fy(s)  + (l-f){Bp+(l-B)y(s)} 

= y{»)  - (1  - f)B{y(s)  - p).  (2.2.19) 

Then  the  following  results  hold: 

p(&,B,  («,  y(«)),  S»'B)  = N~2(N  - n)<r2(M  + AT)/(M  + n);  (2.2.20) 

p(£/*,b,  (s,  y(s)),  <5°)  - (s,  y(*)),  <P’B) 

= (1  - /)2[5oCb  - Po)  + (Bo  - B)(y(s)  - p)]2;  (2.2.21) 

p(6*,b»  (s,  y(*)),  6C)  - p(Zh,b,  {s,  y{s)),  <P’B) 

- (1  -/)2B2(p(s)-p)2;  (2.2.22) 

P&.B,  (5,  1/00),  O - p(^,B,  (a,  y(*)),  ^B) 

= (1  - f)2[Bo\ML(y{s))(y(s)  - p0)  - B(y(s)  - p)]2.  (2.2.23) 


From  (2.2.21)  - (2.2.23)  it  is  very  clear  that  if  we  consider  the  class  T of  all  AT(p,  r2) 
priors,  for  each  one  of  the  estimators  5°,  6C  and  6RB , the  supremum  (over  p (real))  of 
the  left  hand  side  of  (2.2.21)  - (2.2.23)  becomes  +oo,  and  all  these  estimators  turn  out 
to  be  non-robust.  One  reason  why  this  happens  is  that  the  7V(p,  r2)  class  of  priors 
for  all  real  p and  r2(>  0)  is  indeed  too  big  to  be  practically  useful.  As  a next  step,  we 
consider  the  smaller  class  7V(p0,r2)  of  priors,  where  the  mean  p0  is  specified.  This  is 
not  too  unrealistic  since,  very  often  from  prior  experience,  one  can  make  a reasonable 
guess  at  the  center  of  the  distribution. 
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Note  that  when  y = //0,  denoting  ^0jB  by  and  5^°’B  by  5B,  (2.2.21)  - (2.2.23) 
simplify  to 

P(&>  ( s , y{s)),  6°)  - p(fB,  (s, !/(«)),  5B)  = (1  - /)2(£0  - B)2(y(s)  - y0 )2;  (2.2.24) 

p(£b>  (a,  y(a)),  ^C)  - p(£b,  (a,  !/(«)),  5B)  = (1  - f)2B2(y(s ) - //0)2;  (2.2.25) 

P(&>  («>  iM),  O - P(6b  (a,  l/(a)),  5B)  = (1  - f)2(B0\ML(y(s ))  - B)2(£(s)  - /x0)2- 

(2.2.26) 

Accordingly,  from  (2.2.24)  - (2.2.26), 

PORr(<S°)  = (1  - /)2  maxfBj,  (1  - B0)2](j/(s)  - /i„j2,  (2.2.27) 

PORr(ic)  = (1  - /)2(S(s)  - w)2,  (2.2.28) 

P0Rr(O  = (1  - If  max[B02A2M1(j/(S)),  (1  - B0AMJ,(y(s)))2](!/(s)  - «,)2.  (2.2.29) 

Thus,  given  any  C > 0 and  0 < / < 1,  the  posterior  (^-robustness  0f  all  these 
procedures  depends  on  the  closeness  of  the  sample  mean  to  the  prior  mean  y0.  Also,  it 
follows  from  (2.2.27)  - (2.2.29)  that  both  the  subjective  and  robust  Bayes  estimators 
are  more  posterior  robust  than  the  sample  mean  for  the  {N(y0,  t2),  t2  > 0}  class 
of  priors.  A comparison  between  (2.2.27)  and  (2.2.29)  also  reveals  that  the  robust 
Bayes  estimator  arrived  at  by  employing  the  type-II  ML  prior  enjoys  a greater  degree 
of  posterior  robustness  than  the  subjective  Bayes  estimator  if  BQXML(y(s))  > 1/2. 

Although  the  Bayesian  thinks  conditionally  on  (s,  y(s)),  and  accordingly  in  terms 
of  posterior  robustness,  it  is  certainly  possible  to  use  the  overall  Bayes  risk  in  defining  a 
suitable  robustness  criterion.  This  may  not  be  totally  unappealing  even  to  a Bayesian 
at  the  preexperimental  stage  when  he/she  perceives  that  y will  be  occurring  according 
to  a certain  marginal  distribution.  The  overall  performance  of  an  estimator  a(s,y(s)) 
of  7 (y)  when  the  prior  is  £ is  given  by  r(f,a)  = E[p(£,  (s,  y(s)),  a)],  the  expectation 
being  taken  with  respect  to  the  resulting  marginal  distribution  of  y(s).  The  following 
method  of  measuring  the  robustness  of  a procedure  is  given  in  Berger  (1984). 
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Definition  2.2.2  An  estimator  ao(s,  y(s ))  of  7 (y)  is  said  to  be  C-procedure  robust  with 
respect  to  T if 

PRr(a0)  = sup  |r(£,a0)  - inf  r(£,a)|  < C-  (2.2.30) 


We  shall  henceforth  refer  to  PRr(ao)  as  the  procedure  robustness  index  of  a0. 

Next  we  examine  how  the  three  estimators  5°,  8C  and  8RB  compare  according  to 
the  PR  criterion  as  given  in  (2.2.30)  when  we  consider  the  {iV(/i0, r2), t2  > 0}  class 
of  priors.  Using  the  same  notation  as  before  for  the  At(/x0,r2)  prior,  from  (2.2.24) 
- (2.2.26)  it  follows  that 


8°)  - r(&,  8B)  = (1  - f)2(B0  - B)2a2/(nB)-  (2.2.31) 

r{^B,  5C)  - r(£B,  8B ) = (1  - f)2Ba2/n;  (2.2.32) 

O ~ r(SB,  8B ) = (1  - f)2E[(B0\ML(y(s ))  - B)2(y(s)  - /i0)2]-  (2.2.33) 

It  is  clear  from  (2.2.31)  - (2.2.33)  that 


PRr(£°)  = +00, 

PRr(dc)  = (1  - /)V/n, 

PRr(0  = (l-/)2  sup  E[(B0XML(y(s))-B)2(y(s)-fi0)2]. 


(2.2.34) 

(2.2.35) 

(2.2.36) 


0<B<1 

From  (2.2.34)  and  (2.2.35)  it  is  clear  that  the  subjective  Bayes  estimator  lacks 
completely  in  procedure  robustness,  while  the  sample  mean  is  quite  procedure  robust. 
To  examine  the  procedure  robustness  of  8RB , we  proceed  as  follows. 


Theorem  2.2.2  For  every  e > 0,  E[(B0\ML(y(s))  - B)2(y(s)  - //0)2]  = Oe(Bi),  where 
Oe  denotes  the  exact  order. 
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Proof  of  Theorem  2.2.2  Noting  that  n(y(s)-p0)2  ~ ( a2/B)xl,  it  follows  from  (2.2.36) 
that 


E[(BqXMl (y(s))  - B)2(y(s ) - y0)2 
Bo 


roo 

=L  < 


a2  u u}/2~ 1 

1 4-  g exp(uB0/2B)  ~ B^^BU  exp^~  2)  2'/2T(\/2)du'  (2‘2‘37) 


where  g = (e/(l  — e))B0  ^2.  Next  observe  that 


rhs  of  (2.2.37) 


< 


a2  f°° 

n Jo  1 


B2 

Do 


{1  + g exp(uB0/2B)}2 


+ B2}B  XMexp(— ^) 


u . u 


1/2-1 


2;21/2T(l/2) 


du 


a2  B%  , r°°  ii 

5 it' fqB  l exP<-2 (1  + B°B^wmrndu + Bi 


3/2-1 


23/2r(3/2) 


n2g  1 

= 0{B1'2). 


(2.2.38) 


Again,  writing  g'—  max(g,  1), 


rhs  of  (2.2.37) 


> — f°°[ 

n Jo  |2o'  i 


B2 

no 


B~x  - 


2B. 


u v?!2  ^ 

+ B]  exp(--)—ln^,n  du 


{2g' exp{uB0/2B)}2  g exp(-uB0/2B)  ' "J^v  2'23/2T(3/2) 

= — [{B0/2g')2B-\l  + 2BoB-l)-*/2  - 2£0<T1(1  + B0B~ 1)~3/2  + B] 

71/ 

= 0(B1/2).  (2.2.39) 


Combining  (2.2.38)  and  (2.2.39)  the  result  follows. 


Thus,  unlike  the  subjective  Bayes  estimator,  the  robust  Bayes  estimator  using  the 
ML-II  prior  does  not  suffer  from  lack  of  procedure  robustness.  Also,  if  one  examines 
(2.2.32)  and  (2.2.33)  (i.e.,  before  taking  supremum  over  B),  then  using  Theorem  2.2.2 
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it  appears  that  the  classical  estimator  5C  has  a certain  edge  over  6RB  for  small  B. 
Smallness  of  B can  arise  in  two  ways.  One,  n is  very  large,  the  second,  r2  is  much 
larger  compared  to  a2.  This  is,  however,  natural  to  expect  since  small  B signifies 
small  variance  ratio  cr2/r2  which  amounts  to  instability  in  the  assessment  of  the  prior 
distribution  of  0.  It  is  not  surprising  that  in  such  a situation,  it  is  safer  to  use  y(s) 
in  estimating  7(2/)  if  one  is  seriously  concerned  about  long-run  performance  of  an 
estimator. 

Theorem  2.2.2  (specifically  the  upper  bound  derived  in  this  theorem  for  the  dif- 
ference of  Bayes  risks)  clearly  demonstrates  the  procedure  robustness  of  6RB . The 
superior  performance  of  the  POR  index  of  this  estimator  relative  to  the  POR  index 
of  5C  has  already  been  established.  On  the  other  hand  5°  which  performs  better  than 
6C  on  its  POR  index,  fails  miserably  in  its  procedure  robustness.  This  also  shows 
that  the  average  long-term  performance  of  a procedure  can  sometimes  be  highly  mis- 
leading when  used  as  a yardstick  for  measuring  its  performance  in  a given  situation. 
In  this  case,  5RB  seems  to  achieve  the  balance  between  a frequentist  and  a subjective 
Bayesian. 

In  practice,  however,  a2  is  not  usually  known.  In  such  a situation,  one  can  conceive 
of  an  inverse  gamma  prior  for  a2  independent  of  the  prior  for  6 to  derive  a Bayes 
estimator  of  7 (y)  (see  Ericson  (1969)).  In  a robust  Bayes  approach,  if  one  assumes  a 
mixture  of  a normal-gamma  prior  and  a type  -II  ML  prior  for  ( 9 , cr2),  then  the  type-II 
ML  prior  puts  its  mass  on  the  point  ( y(s ),  ^2ies(yi  — v{s))2 ln)-  If  is  possible  to  study 
the  robustness  of  the  resulting  Bayes  estimator  which  will  not  be  pursued  here. 

Next  we  consider  the  problem  of  finding  the  range  of  the  posterior  mean  of  7 (y)  = 
Eili  Vi  over  T*  in  (2.2.3).  Simple  modifications  of  the  arguments  of  Sivaganesan 
and  Berger  (1989)  or  Sivaganesan  (1988)  leads  to  the  following  theorem. 
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Theorem  2.2.3 


sup  E[j(y)\s,y(s)]  = fy(s ) + (1  - 

7rer« 


na6”°(y(s))  + euf(y(s)\6u) 

a + f{y(s)\6u) 


(2.2.40) 


and 


inf  E[y(y)\s,y(s)]  = fy(s)  + (1  - 

7itl  * 


aSn°(y(s))  + 6tf(y(s)\di) 

a + f{y(s)\0i) 


(2.2.41) 


where  a = e(l  -e)-1m(r/(s)|7r0),  6”°(y(s))  = (1  - B0)y(s)  + B0y0,  0t  = vta / yfti+y{s), 
8u  — vua/y/n  + y(s),  and  the  values  of  Vi  and  vu  are  the  solutions  in  v of  the  equation 


e v2/ 2 -cv2  -bv  + c = 0,  (2.2.42) 

where  c = a{2na2)n /2  exp [Eie4(j/i  - y{s))2 /(2a%  and  b = c^(y{s)  - 5*°(y(s)))/a. 

Note  that  the  equation  (2.2.42)  has  exactly  two  solutions  which  may  be  obtained 
by  rewriting  it  in  the  form  v = [— 2\og(cv2  + bv  — c)]1/2  and  solving  it  iteratively  using 
the  initial  values  ~b±Vb  +2(i+2c)(i+c)  ^ 


2.3  Symmetric  Unimodal  Contamination 


The  contamination  class  Q of  the  previous  section  contains  all  possible  distribu- 
tions. Needless  to  say,  such  a class  is  too  big  to  be  useful  in  practice  as  it  may  contain 
many  unwanted  distributions.  In  this  section,  we  consider  the  smaller  but  sufficiently 
rich  class  Q which  contains  all  unimodal  distributions  symmetric  about  /x0.  This  class 
was  considered  by  Berger  and  Berliner  (1986).  The  contamination  prior  pdf  tx  is  given 
by 

^{6)  = (1  - e)vr0(6>)  +eq{0),  (2.3.1) 

where  q E Q,  0 < e < 1,  and  as  before  txq{9)  is  the  N(/j,q,Tq ) pdf.  To  find  the 
ML-II  prior  in  this  case,  we  use  the  well-known  fact  that  any  symmetric  unimodal 
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distribution  is  a mixture  of  symmetric  uniform  distributions  (cf  Berger  and  Sellke 
(1987)).  Thus  it  suffices  to  restrict  q to  Q'  = { Gk  : Gk  is  uniform  (n0-k,n0+k),  k > 
0}.  The  ML-II  prior  is  then  given  by 


K(9)  = (!  “ £)MQ)  + e&(0),  (2.3.2) 

where  qs  is  uniform  (/.to  — k,/j,{ o + k),  k being  the  value  k which  maximizes  m(y(s)\q), 
the  marginal  pdf  of  y(s)  under  q. 

To  find  k,  note  that 


™(y(s)\q)  = I (2? rcr2)  5 exp[--^  £(&  - 0)2]g(d0) 


//to+fc  1 

(2tt<72)-  2 exp[-—  5Z(yi  - 0)2]d0 
.o-k  la  ies 

= (27ra2)-V(2A:V^)-1  exp[-^  g(^  - £(s))2] 
x [$(y/nk/a  - z(s))  - <&(-y/nk/a  - ^(s))], 


(2.3.3) 


where  $ is  the  N(0,  1)  cdf  and  z(s)  = y/n(y(s)  — Ho)/a.  The  solution  k is  given  by 
(cf  Berger  and  Selke  (1987),  pi  17) 

k = k*a/y/n  if  |z(s)|  > 1 

= 0 if  |z(s)|  < 1,  (2.3.4) 

where  k*  satisfies 


$(F  - z(s))  - $(-r  - z{s))  = k*[cf>{k*  - z(s))  + <t>(k * + z(s))j, 

(j)  being  the  N(0,  1)  pdf.  Since  both  sides  of  the  above  equation  are  symmetric  function 
of  z(s),  we  can  replace  z(s)  by  |z(s)|  to  get 

$(F  - \z(s)\)  - $(-**  - k(s)|)  = k*[cf>(k*  - |z(s)|)  + cj>(k * + |z(s)|)].  (2.3.5) 
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Clearly  k*  — 0 is  a solution  of  (2.3.5).  Berger  (1985,  p234)  states  without  proof 
that  there  exists  a unique  solution  &*(>  0)  of  (2.3.5)  for  |z(s)|  > 1.  Since  the  proof 
of  the  uniqueness  result  is  nontrivial,  and  is  very  critical  for  many  subsequent  devel- 
opments, we  provide  below  a proof  of  the  same. 

Lemma  2.3.1  There  exists  a unique  solution  k*  > |z(s)|  of  (2.3.5)  for  |z(s)|  > 1. 

Proof  of  Lemma  2.3.1  For  notational  simplicity,  write  k*  — x and  |z(s)|  = y.  Con- 
sider the  function 


It  suffices  to  show  that  for  every  y > 1,  there  exists  a unique  x > y for  which 
h(x,y)  — 0.  To  see  this,  first  observe  that 


Hence,  if  for  some  x0  > 0,  dh(x , y)/dx\x=Xo  > 0 (the  existence  of  x0  is  guaranteed  due 
to  dh(x,  y)/dx \x=y  = 2y2(t>(2y)  > 0 for  y > 0),  then  from  (2.3.7),  ( x0  - y)  exp(2 x0y)  + 
%o  + y > 0.  This  implies  that  for  every  xx  > x0,  {xx  - y)  exp(2 xxy)  + xx  + y > 0 
which  from  (2.3.7)  is  equivalent  to  dh(x,  y)/dx\x=Xl  > 0.  Thus,  for  a fixed  y,  once 
h(x,  y)  starts  increasing  at  x = x*  (say),  it  continues  to  do  so  from  x*  onwards  for 
that  y.  Moreover  h(0,y)  = 0 for  all  y , h{y,y)  < 0 for  all  y > 1,  and  h(+oo,y ) = 1 
for  all  y.  This  shows  that  for  a fixed  y > 1,  h(x,y)  starts  decreasing  as  a function  of 
x from  0 to  a negative  number,  and  then  starts  increasing  upto  1.  Since  h(y,y)  < 0, 
this  guarantees  the  existence  of  an  x > y for  which  h(x,  y)  = 0. 

Remark  2.3.1  Berger  (1985)  used  the  fact  k*  > |z(s)|  if  |z(s)|  > 1 in  order  to  provide 
an  iterative  procedure  for  finding  k*.  The  above  lemma  substantiates  Berger’s  claim 
in  addition  to  justifying  the  uniqueness  of  k* . 


h{x,  y)  = $(z  - y)  - $(-a:  - y)  - x[(j)(x  - y)  + (j)(x  + y)}.  (2.3.6) 


= x[(x  - y)(j)(x  - y)  + (x  + y)(j){x  + y)] 
= xcj)(x  + y)  [(x  - y)  exp(2 xy)  + x + y] . 


(2.3.7) 
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The  following  theorem  provides  the  conditional  distribution  of  y(s)  given  s and 
y(s)  under  the  ML-II  prior  n*.  Recall  M0  = <j2/t02,  B0  = M0/{M0  + n). 

Theorem  2.3.1  The  posterior  distribution  of  y(s)  (given  s and  y(s))  under  the  prior 

K is 

f(y{s)\s,y(s))  = A5{/(2(s))7r0(2/(s)|s,?/(s))  + (1  - Xsu(z(s)))m(y\qs)/m(y(s)\qs), 

(2.3.8) 

where  for  k > 0, 

A su(z(s ))  = 1 + e(\  - e)~lB0  1,/2(27r)1/2(l/2)[0(A:*  - z(s))  + (/>(k*  + z(s))] 

x exp(B0z2(.s)/2) 

= V (*(«))  (say),  (2.3.9) 

while  for  k = 0, 

SwM*))  = 

= Aj'(z(s))  (say).  (2.3.10) 

Proof  of  Theorem  2.3.1  Arguing  as  in  Theorem  2.2.1,  the  posterior  pdf  of  y(s ) given 
s and  y(s)  is 

Kiv(s)\s,y(s))  = Xy{s))no(y{s)\s,y(s))  + (l-\(y(s)))m(y\qs)/m(y(s)\qs),  (2.3.11) 
where 

A_1(y(s))  = 1 +e(l  - £)~1rn(y(s)\qs)/m(y(s)\n0).  (2.3.12) 

But  for  k > 0, 

m{y(s)\qs)/m(y(s)\TV0) 

= (2i)->B0-'/2£"j‘exp[-^j(«  - S(s))2]<»/ex pbf^ffeM  - Ho)a]. 


(2.3.13) 
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Recall  that  z(s)  — y/n{y{s)  — /io )/o-  Also,  using  (2.3.5),  and  the  fact  that  k = 
k*<j/y/n , 

rf-lo+k  nri 

L-t  exp|-2 

= (27T(T2/n)1/2[$(^(/io  + k-  v{s)) /a)  - $(^(/z0  - * - V{s))/a)] 

= (27 TO1  /n)l,2[${k*  - z(s ))  - $(-k*  - z(s))] 

= ( 2na2/n)1/2k*[(/)(k * - z(s ))  + <f>{k * + *(*))].  (2.3.14) 

Hence,  from  (2.3.13)  and  (2.3.14),  for  k > 0, 

™{y(s)\qs)/m(y(s)\'K0) 

= 2B°  1/2(2 7r)1/2[<A(A:*  - z(s))  + 4>{k*  4-  z(s))]  exp(B0z2(s)/2) 

= ^sviz(s)).  (2.3.15) 

Combine  (2.3.12)  and  (2.3.15)  to  get  (2.3.9). 

Next,  for  k = 0, 

m(y{s)\qs)/m(y(s)\ir0) 

= (2tt a2)~n/2  exp[-^  E iGt(Vi  ~ ^o)2] 

(27ra2)-"/2B,!/2exp[-^{E(s,(w  - /Jo)2  - n(  1 - B„)(S(s)  - «,)2}] 

= B^,/2exp[-(l  - B0)z2(s)/2].  (2.3.16) 

Combine  (2.3.12)  and  (2.3.16)  to  get  (2.3.10).  This  completes  the  proof  of  the  theo- 
rem. 

From  Theorem  2.3.1,  the  robust  Bayes  estimator  of  7 (y)  = A--1  yi  is 

£[7  (y)|«, !/(«)] 

= fy(s ) + A/"-1  XH'W(2:(S))[(1  - Bo)y(s ) + B0H0] 
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+d  - 

J m{y\q$)dy(s) 


(2.3.17) 


But  for  i e s and  k > 0, 

f Vtm(y\q,)dy(s) 


f m(y\qs)dy(s) 

= / f yi(2na2)~N/2  exp(— — ^ Ejlifa  - 6)2)qs{d0)dy{s) 

//(27r<72)-^exp(-^EiIi(!/i  - *m(«»)<fo(*) 


d ^exp(~2?  Si€.(»i  ~ Oftg.W 

dexp(“5^  - «)2)?*w 

CCexp(-^-sM)2w.w 

dex  p(-*(«  - sw)2)?.  w 

. C|(«  - SW)  exp(-^r(9  - 

= y(«)  + - 


= y(«)  - 

= y(«)  - 
= y(«)  - 


CtexP(-*(e-SM)2)?»W 

a 4>{k*  - z{s))  - (j){-k*  - z{s)) 
y/n  $(fc*  - z(s))  - $(-£*  - z(s)) 

a (j>(k*  — z(s))  - ^(— fc*  — z(s)) 

\/n  £*{</>(/;*  - z(s))  + </>(&*  4-  z(s))} 


g 


k*y/n 


tanh(/c*z(s)), 


(2.3.18) 


where  the  penultimate  equality  in  (2.3.18)  uses  (2.3.5). 
From  (2.3.17)  and  (2.3.18),  one  gets  for  k > 0, 


s‘  (*,  !/(«))  = y{s)  - (1  - f)[XiB0(y(s)  - /*„)  + (1  - A 0-7—p  tanh(r*(s))].  (2.3.19) 

rC  yy  Tl 

Similarly,  for  k = 0,  noting  that  tanh(A:*z(s))/A;*  ->  z(5)  as  A:*  -4  0,  one  gets  after 
some  simplifications 


Ssu(s,y(s))  = y(s)  - (1  - /)(!  - A2(l  - B0))(y(s)  - /x0). 


(2.3.20) 
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Also,  generalizing  the  formula  (1.8)  in  Berger  and  Berliner  and  after  some  heavy 
algebra,  one  gets  for  k > 0, 

V{i(y)\s,y{s)) 

= N-2l(N  - n)<72  + (AT  - „) V{-A_  + ifA  tanh(fz(*)) 

M0  + n k*n  " 

x(^)  - ^tanh {k*z{s)))}  -h  Ax(l  — Xi){B0(y(s)  - y0)  - tanh(A:*z(a))}2)] 

(2.3.21) 

while  for  k = 0, 

vh(y)\s,y(s)) 

■ = N-2[(N  - n)a2  + (N-  n)2{a2j^-  + A2(l  - A2)(l  - B„)2(ff(s)  - ^J2)]. 

(2.3.22) 

Next  we  study  the  posterior  and  procedure  robustness  of  the  robust  Bayes  estima- 
tors proposed  in  this  section.  For  posterior  robustness,  calculations  similar  to  those 
of  the  previous  section  provide  for  k > 0, 

p(Sb»  (s>  y(«)),  tsu)  ~ p(£b,  (s,  y(s)),  SB ) 

= 0 - f)2[{B0X\  - B){y(s)  - no)  + (1  - Xi)~^=  tanh(fc*z(s))]2, (2.3.23) 

while  for  k = 0, 

P((b,  (»,  »(»)),  «st')  - p(fu,  («, »(»)),  <SB) 

= (1  - /)2[(1  - B)  - A,(l  - B0)]2(S(s)  - Mo)2.  (2.3.24) 

Accordingly,  from  (2.3.23)  and  (2.3.24),  for  k > 0, 
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PORr  (Ssu) 

= (1  ~ /)2  m&x[{B0\x(y(s)  - no)  + (1  - Ai)-  ^ tanh (k*z(s))}2, 

K yj  Tl 

{(B0Ai  - l)(y(s)  - Ho)  + (1  - Aa)  ^ tanh(fc*z(s))}2],  (2.3.25) 

AC  y/Tl 

while  for  k = 0, 

PORr (Ssu)  = (1  - /)2max[(l  - B0)2\l  {1  - A2(l  - B0)}2](£(s)  - //0)2.  (2.3.26) 

In  order  to  study  the  behavior  of  the  difference  in  the  Bayes  risks  of  6SU  and  6B 
under  the  subjective  7V(yu0,r2)  prior,  we  need  the  following  lemma  which  leads  to  the 
asymptotic  relationship  between  k*  and  |z(s)|  as  |z(s)|  ->•  oo. 

Lemma  2.3.2  (i)  For  large  |z(s)|,  say  |^(s)|  > M0(>  0),  there  exists  c0(0  < Co  < 1) 
such  that  co/k*  < (j)(k*  - |z(s)|)  + (j){k * + \z(s)\)  < 1/A:*; 

(ii)  k*  — |z(s)|  — >•  +oo  as  |2(s)|  — » +oo; 

(iii)  for  |z(s)|  > M\,  0 < k*  — |^r(s) | < c*(log  A;*)1/2  for  some  c*(>  0). 

Proof  of  Lemma  2.3.2  (i)  From  (2.3.5), 

<f>{k*-\z(s)\)  + <Kk*  + \z(s)\)  = [$(**  - |z(s)|)  - $(-A:*  - \z(s)\)]/k* 

< 1/A;*.  (2.3.27) 

Further,  for  |z(s)|  > M0  > 0, 

- \z(s)\)  - *(-k*  - \z(s)\)  = $(k*  + \z(s)\)  - <S>(\z(s)\  - k*) 

> *(|z(a)|)-*(0) 

> $(M0)  - i = c0  (say).  (2.3.28) 

Hence,  combining  (2.3.27)  and  (2.3.28)  one  gets  (i). 

(ii)  From  (i), 

l^imJ^A;*  - \z(s) |)  + 4>(k * + |*(*)|)]  = 0 
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since  k*  > |z(s)|  — > oo  as  |z(s)|  -*  oo. 
Hence, 


lim 

|z(s)|-s-oo 


- |z(s)|)  =0, 


that  is, 

|JSoo(27r)_1/2eXpt-(^  _ lz(s)l)2/2]  = 0. 


This  implies 

lim  (k*  - |z(s)|)2  = +oo 

|z(s)|-KX> 


which  implies  (ii)  since  k*  > |z(s)|  for  |z(s)|  > 1. 
(iii)  Using  (2.3.5)  and  (ii), 


lim 

|z(s)|->00 


k*(t>{k*  - |z(s)|)  = 1. 


This  is  equivalent  to 


0 = lim  (log(r/x/2?)  - (k-  - |z(s)|)2/2] 


which  implies 


( k * - z(s ))2  = 21og(/c*/v/27r)  + o(l)  as  |z(s)|  ->•  oo. 

Consequently, 


k*~\z(s)\  = [2  \og(k*/y/2n)  + o(l)]1/2 

= (21og^)1/2(i+0(i))  as  |z(s)|  — > oo. 

Hence,  there  exists  Mx  such  that  for  |z(s)|  > Mx, 

k*  — |2(s)|  < c*(logfc*)1/2  for  some  c*  > 0. 


This  completes  the  proof  of  the  lemma. 
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In  order  to  study  the  procedure  robustness  of  8SU , first  find  under  the  N(/j,0,t2) 
prior  (denoted  by  £B) 

r(&,0-r(fr,*B) 

- »(«))- ^(«,  y(a))]a 

= (1  - /)2S[{A1S0(y(s)  - Mo)  + (1  - X\)  kt°^~  tanh(fc*g(s))}/^>0] 

+ (1  - A2(l  - B0))(y(s)  - Mo )/pf=OJ  - B(y(s)  - Mo)]2-  (2.3.29) 

We  shall  now  prove  the  following  theorem. 

Theorem  2.3.2  r(fB,  8SU ) - r(fB,  SB ) = 0(B 1/2). 

Proof  of  Theorem  2.3.2  First  use  the  inequality 
rhs  of  (2.3.29) 

< 3(1  - f)2E[{XlB0(y(s ) - Mo)  + (1  - Ai)^^=  tanh(k*  z(s))}2/^>0] 

+(1  - A2(l  - B0))2{y(s ) - Mo )2I(k=0]  + B2(y(s)  - Mo)2]-  (2.3.30) 

Next  observe  that 

E[B2(y(s)  - mo)2]  = E[B2a2(nB)~lx2]  = Ba2/n  = Oe{B);  (2.3.31) 


£[(1  - A2(l  - B0))2(y(s)  - Mo)2/£=0]] 

= £[(1  - A2(l  - B0))2(y(s)  - Mo)2/[22(«)<i]] 

_ Kjr/1  ^ ^0  i2  & 2t  l 

U 1 +9  exp(— (1  - B0)x2/(2B))  ' 7UiXl  M<b ]J 

(g  = e(l+e)-lBo1/2) 

<^-E\U+ C1  ~ Bp)2 2 

- nB  (1  + <?exp(-(l  - B0)x2/(2B)))^XlIM<BV 
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(1  - B0) 


~ nij  1 + 2g exp(— (1  - B0)x?/(2£))) 

^ | (!  ~ -Bo)2exp((1  - Bq)!2)  2 

— nB ^ 2 g jB(Xi  *{xl<B])-  (2.3.32) 


Note  that 


£(x?fea])  = [ {xexp{-x/2)(x/2)1'2  1 /(2T(l/2))}dx 

J 0 

= j\'!2dx  = (2.3.33) 


Combine  (2.3.32)  and  (2.3.33)  to  get 


Ihs  of  (2.3.32)  = 0(B1/2). 


Next  use  the  inequality 


£[{Xi-B0(y(s)  ~ Mo)  + (1  ~ Ai)^^tanh(A;*2:(s))}2/[^>0]] 

< 2^[{A 2xB2{y(s)  ~ Mo)2  + (1  - A1)V(A:*V^)'2tanh2(fe^(s))}/[1f>0]]. 

(2.3.35) 

Now,  writing  g'  = g\/2n, 

E{%B2(y(*)  ~ #<o)2/MJ 

= EfcBl(o2/n)z2(s)Iww>1]\ 

<E\ B$(a2/n)z2(s) 

1 + g’ exp(B0z2(s)/2)((i)(k*  - |z(s)|)  + (j)(k * + |z(s)|))  [*2(s)>1]J' 

(2.3.36) 

Let  K = max(M0,  Mi,  2).  Then  writing  g"  = cog'  and  using  (i)  of  Lemma  2.3.2, 


(2.3.34) 
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rhs  of  (2.3.36) 

< £[B0V/n)*2M{/, .<,’(.)«’]  + j‘+  exp(B0z2(s) /2) j_ V(*)>*=] }] 

— /n)E[B  xXi  I[b<x\<k2b] 

+ k*z2{s)/{k*  + g”  exp(B0z2{s)/2))I[z2(s>K2]}.  (2.3.37) 


But, 


rK2B 

E[x\ I[B<x2<K*B]\  = / {xexp(-a:/2)a:1/2_1/v/27r}^ 

< ^B3/2(K3-  1)/V2tt.  (2.3.38) 

Also,  using  (iii)  of  Lemma  2.3.2,  and  log  k*/k*  < 1 for  |z(s)|  > K 

E[k*z2(s){k*  + g"  exp(B0z2(s)/2)}~1I[z2is)>K2]\ 

< E[z2(s){z{s)  +ct(\ogk*)l'2){k*  + g"  exp(B0z2(s)/2)}-1I[z2{s)>K2]] 

< E[{\z(s)\3/(g"exp(B0z2(s)/2))}I[z2{s)>Ka] 

+c*z2(s)(2^"exp(B0z2{s)/4))-1r[z2{s)>K2]\.  (2.3.39) 


E[\z(s)\3exp(-B0z2(s)/2)I[Z2{S)>K2]] 

= E{(x\/Bf2  ^M-\EqxVE)I]x2>K2B]] 

roo  1 1 

= B'3/2  Lbx3/2qM~2BBoX  ~ 2X)xX/2~l^)~X/2dx 

< (27r )-1/2^-3/2  f a:eXp(— + 1 ))dx 
Jo  2 B 

= (27t)-1/2B-3/24(§  + l)-2  = 0(BX'2). 

B 


(2.3.40) 
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Moreover, 


E\z2(s)  exp(-^z2(s))/[z2{,)>^2]] 

= E\{x\/B)  exp(-^x?)/[x2>/<2B]] 

=B'7«“sexp(-f(S+i^,/2wv^ 

= 0(B1^2). 

Combine  (2.3.37)  - (2.3.41)  to  conclude  that 

E[\\Bl(${s)  - /x0)2/M]  = 0{BW). 

Finally, 


£[(i  - M)s 


a 


(, k*)2n 


tanh  2(fc*z(s))/^>0]] 


< ~Ei(k*)  2 tanh2(A:tz(s))/[|j.(a),>i]] 

2 

- — E[z2{s)I[x<\z(s)\<k]  + (k*)~2 1[\Z(S)\>K]\, 


where  in  the  final  inequality  of  (2.3.43),  we  use  | tanh(Ar*s(s))|  < jfc*|;z(s)| 
|z(s)|  < K and  | tanh(fc*z(s))|  < 1 for  |^r(s)|  > K. 

As  before 

*[2!W'[K|.M|<lol  = 0(B1/2)- 

Also,  using  (ii)  of  Lemma  2.3.2, 


E[(**)-aJiW.)|>*]] 

< £[(Z2(s))-,/|»I„)>Ki]]  = E[B(Xf)-'/w>K,B]] 

roo  1 

= B Jk2bX~1  eXP(~  2X)‘I'1/2_1(^7r)_1^2^'r 


(2.3.41) 

(2.3.42) 


(2.3.43) 
for  1 < 

(2.3.44) 
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/ r°° 

< B(2n)~l/2  J^^x~3/2dx  = 2B(2tt)~1/2(K2 B)-1/2 
= 0(BV2).  (2.3.45) 

From  (2.3.43)  - (2.3.45), 

lhs  of  (2.3.43)  = 0(B1/2). 

Combine  (2.3.30),  (2.3.31),  (2.3.32),  (2.3.34),  (2.3.35),  (2.3.42),  (2.3.43)  and  (2.3.45) 
to  get  the  theorem. 

Remark  2.3.2  It  follows  from  the  above  theorem  that  as  n -4  oo,  i.e.  B — > 0,  under 
the  subjective  N(y0,r2)  prior,  8SU , the  robust  Bayes  estimator  of  7 (y)  = TV-1  Vi 
is  asymptotically  optimal  in  the  sense  of  Robbins  (1955). 

Next,  to  find  the  range  of  E[y(y) |s,  ?/($)]  over  T*  in  (2.3.1),  once  again  applying 
Sivaganesan  and  Berger  (1989)  or  Sivaganesan  (1988),  we  get  the  following  theorem. 

Theorem  2.3.3 

sup  E['y(y)\s,  y(s )]  = fy(s ) + (1  - /)—  (2.3.46) 

Trer.  a + H(zu)  v ’ 

and 

inf  E[y(y)\s,y(s)]  = fy(s ) + (1  - f)?S  ^^  + Hi(zt)  (2.3.47) 

xer*  a + H{zi)  v ' 

where 

, 1 rno+z 

H{z)  = — f(y(s)\6)d9  ifz^O 

LZ  J hq—z 

= f(y{s)\no)  if  2 = 0;  (2.3.48) 

1 rno+z 

Hi  (z)  = — ef(y(s)\6)d$  if  2^0 

ZZ  J^lq—z 

= t*of(v(s)\no)  if  2 = 0,  (2.3.49) 
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while  the  values  of  zi  and  zu  are  given  by  the  solutions  of  the  equation 

„ _ (y(s)v  - uoj y/n)(wt  + 2 aa/y/n)  - v(2a0a/s/n  + wtuo) 
z — 

2 [auz  + t(an0  — a0 ) + wuv/2 ] 


(2.3.50) 


where  t = “),  u = „ = 


' a/y/n  / 


' c/y/n 


),  and  w = exp [— ^2  Eies{yi  - v{s)) 


21  (2irc 


2)-(n-l)/2 

v/n 


Note  that  the  equation  (2.3.50)  may  be  iteratively  solved  for  z by  taking  a number 
larger  than  ^“(^(s))  as  the  initial  value  of  z when  maximizing,  and  a number  smaller 
than  Sno(y(s))  as  the  initial  value  of  z when  minimizing. 


2.4  An  Example 


This  section  concerns  the  analysis  of  real  data  set  from  Fortune  magazine,  May 
1975  and  May  1976  to  illustrate  the  methods  suggested  in  Sections  2.3  and  2.4.  The 
data  set  consists  of  331  corporations  in  US  with  1975  gross  sales,  in  billions,  between 
one-half  billion  and  ten  billion  dollars.  For  the  complete  finite  population,  we  find  the 
population  mean  to  be  1.7283  and  the  population  variance  2.2788.  The  population 
variance  is  assumed  to  be  known  for  us.  We  select  10%  simple  random  sample  without 
replacement  from  this  finite  population.  So  the  sample  size  is  n=33.  We  can  obtain 
easily  the  sample  mean  and  the  corresponding  standard  error.  We  use  gross  sales 
in  previous  year  as  prior  information  to  elicit  the  base  prior  n0.  The  elicited  prior 
7To  is  the  N(1.6614,  6.4351  xlO-3)  distribution.  Under  this  elicited  prior  7To,  we  use 
formulas  (2.2.1)  and  (2.2.2)  to  obtain  the  subjective  Bayes  estimate  and  the  associated 
posterior  variance.  But  we  have  some  uncertainty  in  7T0  and  the  prior  information, 
so  we  choose  e = .1  in  T*  and  we  get  the  robust  Bayes  estimates  and  the  associated 
posterior  variances  using  the  formulas  (2.2.16),  (2.2.17),  (2.3.19)  and  (2.3.21).  A 
number  of  samples  were  tried  and  we  have  reported  our  analysis  for  one  sample.  Table 
2.1  provides  the  classical  estimate  5C,  the  subjective  Bayes  estimate  5°,  the  robust 
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Table  2.1.  Estimates,  Associated  Standard  Errors  and  Posterior  Robustness  Index 


Estimate 

SE 

1 7(2/)  - 6\ 

POR 

5C 

1.9881 

0.2852 

0.2598 

8.6506  xl0~2 

6° 

1.7191 

0.0105 

9.2187xl0-3 

7.2386  xlO-2 

6RB 

1.7704 

0.1459 

4.2075  xl0~2 

4.7416xl0-2 

Ssu 

1.7313 

0.1184 

3.0239xl0-3 

6.5948xl0-2 

Bayes  estimate  6RB  with  all  possible  contaminations,  the  robust  Bayes  estimate  Ssu 
with  all  symmetric  unimodal  contaminations  and  the  respective  associated  standard 
errors.  Table  2.1  also  provides  the  posterior  robustness  index  for  each  estimate  which 
is  in  a sense  the  sensitivity  index  of  the  estimate  as  the  prior  varies  over  T. 

From  Table  2.1,  we  may  find  that  the  robust  Bayes  estimates  5SU  and  5RB  are  well 
behaved  in  the  sense  that  they  are  closer  to  7 (y)  than  at  least  the  classical  estimate 
Sc  and  Ssu  is  closer  to  7 (y)  than  even  the  subjective  Bayes  estimate  d°.  The  robust 
Bayes  estimates  Ssu  and  5RB  are  also  good  in  the  viewpoint  of  the  posterior  robustness 
index. 

For  finding  the  range  of  the  posterior  mean  of  7 (y)  = N -1  y{  over  T*,  we  get 

that  the  posterior  mean  is  in  the  interval  (1.7120,  1.7868)  for  all  possible  contamina- 
tion case  and  in  the  interval  (1.7166,  1.7406)  for  all  symmetric  unimodal  contamina- 
tion case.  Observe  that  the  second  interval  is  much  shorter  than  the  preceding  one. 
Also  note  that  robust  inference  is  achieved  for  T*  in  either  case.  So  if  we  feel  that  the 
true  prior  is  close  to  a specific  one,  say  7To,  we  should  model  through  T*  using  one  or 
the  other  contamination  model  gaining  robustness  in  each  case  as  compared  to  the 
use  of  a subjective  case. 


CHAPTER  3 

ROBUST  BAYES  COMPETITORS  OF  THE  RATIO  ESTIMATOR 

3.1  Introduction 

For  most  sample  surveys,  for  every  unit  i in  the  finite  population,  information  is 
available  for  one  or  more  auxiliary  characteristics,  characteristics  other  than  the  one 
of  direct  interest.  For  example,  if  the  characteristic  of  direct  interest  is  the  yield  of 
a particular  crop,  the  auxiliary  characteristic  could  be  the  area  devoted  that  crop  by 
different  farms  in  the  list.  We  consider  the  simplest  situation  when  for  every  unit  i 
in  the  population,  value  of  a certain  auxiliary  characteristic,  say  £*(>  0)  is  known  (i 
= 1,2,...,  N). 

The  classical  estimator  of  the  finite  population  mean  Y-1  Ui  in  such  cases 
is  the  ratio  estimator  eR  = N~l  {Y^iesVi/  T,iesxi)  which  seems  to  incorporate 

the  auxiliary  information  in  a very  natural  manner.  Moreover,  this  estimator  can 
be  justified  both  from  the  model  and  design  based  approach.  While  Cochran  (1977) 
provides  many  design-based  properties  of  the  ratio  estimator,  Royall  (1970,  1971) 
justifies  this  estimator  based  on  certain  superpopulation  models. 

The  ratio  estimator  can  also  be  justified  from  a Bayesian  viewpoint.  To  see  this, 
consider  the  superpopulation  model  ?/,  = /3xi+ei , where  et-  are  independent  N( 0,  cr2Xj), 
i = 1, 2, . . . , N,  while  (3  is  uniform  over  (-oo,  oo).  In  the  above,  a2(>  0)  may  or  may 
not  be  known.  For  unknown  cr2,  one  assigns  an  independent  prior  distribution  to  a2. 
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Then  based  on  s and  y(s),  the  posterior  (conditional)  mean  of  7 (y)  = N~l  Vi  is 
given  by  eR. 

It  is  clear  from  the  above  that  the  ratio  estimator  can  possibly  be  improved  upon 
when  one  has  more  precise  prior  information  about  0.  For  example,  if  one  wants  to 
pi  edict  the  mean  yield  of  a certain  crop  based  on  a sample  of  farms,  it  is  conceivable, 
on  the  basis  of  past  experience,  to  specify  a prior  distribution  for  0 with  fairly  accurate 
mean  0O  and  variance  r02.  Ericson  (1969)  has  indeed  shown  that  with  a N(0 „,  r02)  prior 
to  0,  the  Bayes  predictor  eE  of  7 (y)  is  given  by 

eE{s,y(s))  = N-'[J2vi  + {(o’-2  Vi  + ro20o)/(a~2  + r0-2)}  ^x,].  (3.1.1) 

i€s  ies  i£s 

Clearly  eE  converges  to  eR  as  Tq  — > 00,  that  is,  when  the  prior  information  is  vague. 

The  above  Bayesian  approach  has  been  frequently  criticized  on  the  ground  that  it 
preassumes  an  ability  to  completely  and  accurately  quantify  subjective  information 
in  terms  of  a single  prior  distribution.  We  shall  see  in  the  next  section  that  failure 
to  specify  accurately  one  or  more  of  the  parameters  0q  and  Tq  can  have  a serious 
consequence  when  calculating  the  Bayes  risk  and  often  protection  is  needed  against  the 
possibility  of  such  occurrence.  A robust  Bayesian  viewpoint  assumes  that  subjective 
information  can  be  quantified  only  in  terms  of  a class  T of  possible  distributions. 
Inferences  and  decisions  should  be  relatively  insensitive  to  deviations  as  the  prior 
distribution  varies  over  T. 

In  this  chapter,  we  consider  an  e-contamination  class  of  priors  for  0 following  the 
lines  of  Berger  and  Berliner  (1986).  In  Section  3.2,  the  e-contamination  class  includes 
all  possible  distributions.  For  every  member  within  this  class,  the  Bayes  predictor 
(posterior  mean)  of  7 (y)  is  obtained,  and  expressions  for  variations  of  the  posterior 
mean  within  this  class  of  priors  are  given  following  Sivaganesan  and  Berger  (1989) 
and  Sivaganesan  (1988).  Moreover,  the  ML-II  prior  (see  Good  (1965)  or  Berger  and 
Berliner  (1986))  within  this  contamination  class  of  priors  is  found.  We  provide  also 
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analytical  expressions  for  the  indices  of  posterior  and  procedure  robustness  (cf  Berger 
(1984))  of  the  proposed  robust  Bayes  predictors  based  on  ML-II  priors  for  an  entire 
class  of  priors,  and  have  compared  these  indices  with  similar  indices  found  for  the  usual 
ratio  estimator  as  well  as  the  subjective  Bayes  predictor.  In  the  course  of  calculating 
indices  for  procedure  robustness,  we  have  proved  the  asymptotic  optimality  in  the 
sense  of  Robbins  (1955)  of  the  robust  Bayes  predictor,  and  have  also  pointed  out  that 
the  subjective  Bayes  predictor  does  not  possess  the  asymptotic  optimality  property. 

The  above  program  is  repeated  in  Section  3.3  with  the  exception  that  the  contam- 
ination class  now  contains  only  all  symmetric  unimodal  distributions.  Once  again, 
robust  Bayes  predictors  are  found,  and  their  optimality  is  studied. 

Finally,  in  Section  3.4,  a numerical  example  is  provided  to  illustrate  the  results  of 
the  preceding  sections. 

The  summary  of  our  findings  is  that  the  robust  Bayes  predictors  enjoy  both  pos- 
terior and  procedure  robustness  for  a fairly  general  class  of  priors.  The  subjective 
Bayes  predictor  enjoys  posterior  robustness,  but  fails  miserably  under  the  criterion 
of  procedure  robustness.  The  classical  ratio  estimator  is  inferior  to  the  others  under 
the  criterion  of  posterior  robustness,  but  enjoys  certain  amount  of  procedure  robust- 
ness: Thus,  our  recommendation  is  to  give  the  robust  Bayes  predictors  every  serious 
consideration  if  one  is  concerned  with  both  Bayesian  and  frequentist  robustness.  The 
other  important  finding  is  that  when  the  sampling  fraction  goes  to  zero,  that  is,  we 
are  essentially  back  to  an  infinite  population,  several  asymptotic  optimality  proper- 
ties of  the  estimators  of  Berger  and  Berliner  (1986)  are  established  which  are  hitherto 
unknown  in  the  literature. 

Royall  and  Pfferman  (1982)  have  addressed  a different  robustness  issue.  They 

have  shown  that  under  the  superpopulation  models  (i)  yi  ~d  N(a  + (3xi,a2Xi)  and 

(ii)  yi  ~d  N(/3xi,a2Xi),  if  /?  is  uniform  ( — oo,  oo),  then  under  balanced  samples,  that 
is,  x(s)  = x(s),  the  posterior  distribution  of  y(s)  given  {s,y(s))  is  the  same  under 
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the  two  models,  the  resulting  estimator  of  the  finite  population  mean  being  the  ratio 
estimator  which  equals  the  sample  mean  in  this  case. 

For  simplicity,  we  shall  assume  that  n(s)  ^ n =$■  p(s)  = 0,  that  is,  we  effec- 
tively consider  only  samples  of  fixed  size  n.  Also,  for  notational  simplicity,  we  shall, 
henceforth,  assume  that  s = {«i,  •••,*„}  where  1 < ix  < • • • < in  < N.  Let 
* = {l,2,  •••,  N}  - s = {ii , • • • , jN-n}  (say),  where  1 < j\  < ■■■  < jw_n  < N. 
We  shall  write  y(s)  = {yh,  ■ - • , yin)T,  y(s ) = (yhl  • • • , yjN_n)T,  x{s ) = (Xil,  ■ ■ ■ ,xin)T, 
D(s)  — Diag(xtl , • • • , Xin ),  x(s)  = (xj1,  • • • , XjN_n)T , and  D(s ) = Diag(xjj , • • • , XjN_n). 
Note  that  writing  M0  = a2/r$,  B0  = B0(s ) = M0(A/0  + nx(s))-1  and  / = n/N  (the 
sampling  fraction),  the  Bayes  predictor  e#  given  in  (3.1.1)  can  be  alternately  written 
as 

cb0(s,  y(s))  = fy{s ) + (1  - /)x(s)[(l  - B0(s))y(s)/x(s)  + B0{s)/30].  (3.1.2) 

In  later  sections,  we  shall  repeatedly  use  (3.1.2).  Also,  the  associated  posterior  vari- 
ance is  given  by 

V(7(y)\s,y(s))  = cr2  N~2[(N  - n)x(s)  + (N  - n)2x2(s)/(M0  + nx(s))].  (3.1.3) 

3.2  g-Contamination  Model  and  the  ML-II  Prior 

Suppose  that  conditional  on  (3 , yt  are  independent  N((3xi,cr2Xi),  i — 1,2, 

Also,  let  (3  have  the  prior  distribution  n = (1  o+eq,  where  e G [0, 1)  is  known,  7 r0 
is  the  N((30,Tq)  distribution,  and  q G Q,  the  class  of  all  possible  distributions.  Then 
the  marginal  pdf  of  y(s)  is  given  by 

m(y(s)\n)  = (1  - e)m(y(s)|7r0)  + em(y(s)\q),  (3.2.1) 

where  m(y(«)|7T0)  denotes  the  N(/30x(s),  a2D(s)  + t$x(s)xt(s))  pdf,  while 


m{y{s)\q)  = f (2ttct2)  2)exp[--L  - f3xi)2/xi}q{d(3). 

ies  Za  ies 


(3.2.2) 
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The  posterior  pdf  of  @ given  (s,  y(s))  is  now 

tt(/?|s,  y(s))  = \(y(s))no{0\s,  y(s ))  + (1  - \(y(s)))q(P\s,  y(s)),  (3.2.3) 

where 

A(s/(-s))  = (1  - £)m(j/(s)|7r0)/m(?/(s)|7r),  (3.2.4) 

7r0(/?|.s,2/(s))  denotes  the  7V((1  - B0(s))y(*)/*(«)  + B0(s)A),  er2(M0  + mr(s))-1)  pdf, 
and 

q(/3\s,y(s))  = /(y(s)|/?)g(/?)/m(y(s)|g).  (3.2.5) 

This  leads  to  the  posterior  pdf  of  y(s)  given  (s,y(s))  as 

7r(y(s)k,3/(s))  = J f(y(s)\p)n(p\s,y(s))dp 

= Hy(s))n0(y(s)\s,y(s))  + (l  - \(y(s)))q(y(s)\s,  y(s))(3.2.6) 

where  7r0(y(*)|s,  y(s))  is  the  iV(((l  - B0(s))y(s)/x(s)  + B0(s)p0)x(s),a2(D(s) 

+ (Mo  + nx(s))~1x(s)xT(s)))  pdf  while  using  (3.2.5), 

$(y(*)l*>  y(s ))  = m{y\q)/™(y(s)\q)-  (3.2.7) 

Then  the  posterior  mean  of  7 (y)  is  given  by 


£[7(v)l«.y(s)] 

= fy(s)  + (1  - f)x(s) 


(1  - £)m(y(s)\TT0)67To(y(s))  + g f Pf(y(s)\p)q(dp) 
(1  - e)m(y(s) |ir0)  + em(y(s)\q) 


(3.2.8) 


where 


^°(y(s))  = (1  - B0(s))y(s)/x(s)  + B0(s)p0 ■ (3.2.9) 

From  Sivaganesan  and  Berger  (1989)  or  Sivaganesan  (1988),  it  follows  that 


sup(inf)£[7(y)|s,j/(s)]  = fy(s)  + (1  - f)x(s) 

7rgr 

x finfl  l1  - £)m(y(s)\n0)6*°{y(s))  + ePf(y(s)\p) 
P (l-e)m(i/(s)|7ro)  + e/(y(s)|0) 


(3.2.10) 
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Hence,  following  Sivaganesan  (1988),  we  get 


sup  E[y(y)\s,  j/(s)] 

7r€r 


= fy(s)  + (l  - f)x(s)— 


- £)m(tj(s)\7r0)5*°(y(s))  + ePufjyjs) \/3v) 
(1  - £)m(y(s)\Tr0)  + Ef{y(s)\pu) 


while  infw6r  E['y(y)\s,y(s)]  has  a similar  expression  as  (3.2.11)  with  /3L  replacing  /3V. 
In  the  above  flu  and  pi  (fiL  < fjjj)  are  given  by 


Pv  = y{s)/x(s)  + V(/a(nx(.s))  1/2,  (3L  = y(s)/x(s)  + vLa(nx(s))~1/2 
where  Vu  and  vL  (<  vv)  are  solutions  in  v of  the  equation 

e~v2/2  - c(u2  - 1)  - bv  = 0,  (3.2.12) 

where  c = a(2-Ka2)n/2  {Ui€s  x]1 2)  exp[^  £*,(&  - x^s)  / x{s))2  / Xi],  b = c{nx{s))1'2 
(y(s)/x(s)  - 6n°(y(s)))/a,  and  a = e(l  - e)-1m(?/(s)|7ro).  We  shall  use  (3.2.11)  in 
Section  3.4  for  numerical  evaluation  of  the  supremum  and  infimum  of  the  posterior 
mean  under  the  given  class  of  priors. 

Next  we  find  the  ML-II  prior  within  the  given  class  of  priors.  Since  — 

fixiflxi  is  minimized  with  respect  to  p at  ps  = y(s)/x(s),  from  (3.2.1)  and  (3.2.2), 
the  ML-II  prior  which  maximizes  the  marginal  likelihood  ra(y(s)|7r)  with  respect  to 
q € Q is  given  by 

kml(P)  = (1  - e)7r0(/?)  + eqs(P),  (3.2.13) 

where  qs(p ) is  degenerate  at  p — /3S.  The  posterior  pdf  of  y(s)  given  by  (s,  y(s)) 
under  the  ML-II  prior  j tml  is  now  given  by 

KML(y(s)\s,y(s)) 

= ^ ML{y(s ))  7r0(y(s)|s,  y(s))  + (1  - Xml(V(s)))  N((y(s)/x(s))x(s),  a2D[s)) 


(3.2.14) 
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where  for  0 < e < 1,  after  some  algebraic  simplifications 
Vi(y(«)) 

= 1 +e(l  ~ £)~lm(y(s)\qs)/m(y(s)\n0) 

= 1 + e(l  - ^)_1B0  2(s)exp(nB0(s)(y(s)  - /50^(s))2/(2cr2^(s)))-  (3.2.15) 
The  robust  Bayes  predictor  of  7 (y)  under  the  ML-II  prior  7tml  then  simplifies  to 

eRB(s,  y(«))  = /y(«)  + (1  - f)x(s){(  1 - AML(y(s))£o(s))y(s)/z(s) 

+AMz,(y(s))B0(s)/?o}-  (3.2.16) 

Also,  generalizing  the  formula  (1.8)  in  Berger  and  Berliner  (1986),  one  gets  the  asso- 
ciated posterior  variance  given  by 

V(7(l/)|si !/(«)) 

. = iV"2[er2(iV  - n)x(S)  + (N  - n)2x2(s){a2-  ^ML_  + AML(1  - \ML) 

Mo  + nx{s) 

xBo(s){V(s)/x(s)  - /?0)2}].  (3.2.17) 

We  shall  now  compare  the  robust  Bayes  predictor  eR#  of  7 (y)  with  the  Bayes 
predictor  es0  given  in  (3.1.2)  and  the  ratio  estimator  e r in  terms  of  posterior  risks 
as  well  as  the  Bayes  risks  under  the  {N(/30,  r2),  r2  > 0}  class  of  priors.  For  a typical 
member  N((30,t2)  of  this  class,  the  Bayes  predictor  of  7 (y)  is  given  by 

eB(s,  y(s ))  = fy(s ) + (1  - f)x(S)[{  1 - B{s))y(s)/x(s)  + B{s)(30 ],  (3.2.18) 

where  B = B(s)  = M/(M  + nx(s)),  M - o^/r2. 

The  choice  of  the  above  class  of  priors  may  be  justified  as  follows.  Very  often, 
based  on  prior  elicitation,  one  can  take  a fairly  accurate  guess  at  the  prior  mean. 
However,  the  same  need  not  necessarily  be  true  for  the  prior  variance,  where  there  is 
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a greater  chance  of  vagueness.  Note  that  when  r2  ± Tq,  none  of  the  estimators  eR, 
6s0  or  £rb  is  the  optimal  (Bayes)  estimator. 

Based  on  Definition  2.2.1  introduced  in  Chapter  2,  we  examine  the  posterior  ro- 
bustness indices  of  eBB,  eBo  and  eR.  Note  that  whether  or  not  posterior  robustness 
exits  will  often  depend  on  which  (s,  y(s))  is  observed.  This  will  be  revealed  in  the 
subsequent  calculations. 

With  this  end,  first  note  that  under  the  7V(/?0,r2)  prior  denoted  by  £T2,  the  pos- 
terior risk  of  any  estimator  e of  7 (y)  is 

p(6t-2,  (■ s,y(s)),e ) = p(£T2,  (s,  y(s)),eB)  + (e  - eB)2  (3.2.19) 

where  eB  is  given  in  (3.2.18).  Using  Definition  2.2.1  and  (3.2.19)  one  gets  for  the  class 
T = {£t2  : t2  > 0}  of  priors 

PORr(e«)  = sup  (1  - f)2x2(s)B(s)2[y(s)/x(s)  - (30]2 

0<B(s)<l 

= (1  ~ f)2x2(s)[y{s)/x{s)  - /30]2]  (3.2.20) 

PORr(eB0)  = sup  (1  - f)2x2(s)(B(s)  - B0(s))2[y(s)/x(s)  - /?0]2 

0<B(s)<l 

= (1  - f)2x2[s)  max[R0(5)2,  (1  - B0{s ))2] 

x[y{s)/x(s)  — /5o]2;  (3.2.21) 

PORr(efiB)  = sup  (1  - f)'2x2(s)(\ML(y(s))B0(s)  - B(s))2[y(s)/x(s)  - (30 ]2 

0<B(s)<l 

= (1  - f)2x2{s)  max[\2ML(y(s))BQ(s)2 , (1  - \ML{y(s))B0(s))2] 

x[y(s)/x(s)  - po}2.  (3.2.22) 

Note  from  (3.2.20)  - (3.2.22)  that  if  we  allow  all  possible  distributions  N((3,  r2),  where 
P is  widely  different  from  /30  as  our  priors,  all  POR  indices  can  become  prohibitively 
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large.  It  follows  from  (3.2.20)  - (3.2.22)  that  both  eBo  and  eRB  are  superior  to  e a in 
terms  of  posterior  robustness.  However,  the  ratio 

PORr(eB0)/PORr(efiB) 

= max[B2,  (1  - B0)2]/  max[A 2ML{y{s))B2,  (1  - AML(y(s))£0)2]  (3.2.23) 

can  take  values  both  larger  and  smaller  than  1 depending  on  the  particular  (s,y(s)). 

Although  the  Bayesian  thinks  conditionally  on  (s,  y(s)),  it  seems  quite  sensible  to 
use  the  overall  Bayes  risk  as  a suitable  robustness  criteria,  at  least  at  a preexperi- 
mental  stage.  This  issue  is  also  addressed  in  Berger  (1984)  who  introduced  also  the 
criterion  of  procedure  robustness.  We  use  Definition  2.3.2  to  study  the  procedure 
robustness  indices  of  eRB,  eBo  and  eR. 

Simple  calculations  yield  for  the  class  T = {£T2  : r2  > 0}  of  priors 

PHr(eB)  = sup  (1  - f)2x2{s)a2{nx{s))~l  B(s) 

0<J9(s)<l 

= (1  - /)2x2(s)er2(nx(s))-1;  (3.2.24) 

PRr(eSo)  = sup  (1  - f)2x2(s)a2(nx(s))~l(B0(s)  - B(s))2/B(s) 

0<B(s)<l 

= +oo;  (3.2.25) 

PRr(eBB)  = sup  (1  - f)2x2(s)E[(B0(s)\ML(y(s))  - B(s))2(y(s)/x(s)  - fo)2}. 

0<S(s)<l 

(3.2.26) 

It  is  thus  clear  that  the  subjective  Bayes  predictor  eBo  lacks  procedure  robustness, 
while  the  ratio  estimator  eR  is  quite  procedure  robust.  The  procedure  robustness  of 
eRB  can  be  examined  on  the  basis  of  the  following  theorem. 

Theorem  3.2.1  E[(B0(s)XML{y{s))-B(s))2(y(s)/x(s)-/30)2]  = Oe(B1/2(s)),  for  every 


£ > 0,  where  Oe  denotes  the  exact  order. 
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Proof  of  Theorem  3.2.1  Noting  that  nx{s)(y(s)/x(s)  - $> )2  ~ {a2/B)xj,  it  follows 
from  (3.2.26)  that 


E[{B0XML(y(s))/x(s))  - B)2(y(s)  - /30)2] 
B0  - 2 


Bq  a u v}/2~1 

~ Jo  U+gexp{uB0/2B)  ~ nx{s)BU eXP^~ 2 ) 2V2r(l /2) dM’(3-2'27) 

where  g = (e/(l  - e))^1/2.  Next  observe  that 


rhs  of  (3.2.27) 


< 


a r°° 


roc 

l <- 


Bl 


v 

+ B2}B~1uex p(--)-,,^^ 


< 


nx(s)  Jo  {1  + gexp(uB0/2B)}2  ' J 2y2x/2r(l/2) 

(72  fg02  D_J  /“*>  7/  . 7,3/2— 1 

nx(s) 


<7 


mr(s) L 2 c/ 

= 0(J5 1/2). 

Again,  writing  g'=  max(g,  1), 
rhs  of  (3.2.27) 

'2  «»  Bl 


[TgB  L exp(-2(1  + B»B“‘))53Af(375)<i”  + B) 
[|iB-1(l  + BoB_1)'3/2  + B] 


(3.2.28) 


> 


a 


r oo 

/o  ^12^ 


B 


-l 


2£fi 


nx(s)  J o {2t/'exp(uB0/2B)}2  gexp(-uB0/2B) 


+ B } 


U 7i3/2-1 

xexp(-^)  - 


2;  23/2r(3/2) 

[(B0/2^)2S-1(1  + 2B0B-l)-V2  - 2B0g~\l  + B0B~^I2  + B\ 

(3.2.29) 


nx(s) 

= 0(fi1/2). 

Combining  (3.2.28)  and  (3.2.29)  the  result  follows. 


When  / — » 0,  we  get  a result  related  to  the  procedure  robustness  of  the  robust 
Bayes  procedure  of  Berger  and  Berliner  (1986).  To  our  knowledge,  such  a result  is 
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the  first  of  its  kind.  In  addition  as  n ->■  oo,  that  is,  B ->  0,  it  shows  that  the 
robust  Bayes  procedure  is  asymptotically  optimal.  In  view  of  (3.2.24)  and  (3.2.26) 
and  Theorem  3.2.1  (i.e.,  before  taking  supremum  over  B)  it  appears  that  e r has 
distinct  advantage  over  e##  for  small  B.  This  is  not  surprising  though  since  small  B 
signifies  small  M = <r2/r2  which  amounts  to  greater  instability  in  the  assessment  of 
the  prior  distribution  of  (3  relative  to  the  superpopulation  model.  It  is  not  surprising 
that  in  such  circumstances,  it  is  safer  to  use  e#  for  estimating  7 (y)  if  one  is  seriously 
concerned  about  the  long-run  performance  of  the  estimator. 

A different  point  to  note  is  that  in  reality,  a2  is  not  usually  known.  In  such 
situations,  however,  one  can  conceive  of  an  inverse  gamma  prior  for  a2  independent 
of  the  prior  for  f3  to  derive  a Bayes  estimator  of  7 (y).  In  a robust  Bayes  approach, 
if  one  assumes  a mixture  of  a normal-gamma  prior  for  (/?,  a2),  then  the  ML-II  prior 
puts  its  mass  on  the  point  (y(s)/x(s),  n~l  EiesiVi  ~ Xiy(s)/x(s))2 /xi).  We  have  not, 
however,  pursued  here  the  resulting  Bayesian  analysis. 

It  should  be  noted  though  that  the  class  Q contains  many  priors  which  should 
possibly  not  come  under  consideration.  In  the  following  section,  instead  of  Q , we 
consider  the  <3*  which  contains  only  symmetric  unimodal  distributions. 

3.3  Symmetric  Unimodal  Contamination 

Suppose  now  the  contamination  prior  distribution  is  given  by 

*(0)  = (1  - AM/?)  + eq(P),  (3.3.1) 

where  q e Q*,  the  class  of  all  possible  symmetric  unimodal  distributions,  and  7To  is 
as  in  Section  3.2.  The  expression  for  E[y(y)\s,y(s)]  is  the  same  as  the  one  given 
in  (3.2.8).  Using  the  fact  that  any  symmetric  unimodal  distribution  is  a mixture  of 
symmetric  uniform  distributions  as  in  Sivaganesan  and  Berger  (1989),  one  gets 

sup(inf)£[7(y)|3,s/(s)]  = fy(s)  + (1  - f)x(s) 
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v -nnfinfl il  ~ £)m(^(5)l7ro)^°(^(g))  + eH^k) 

* ’ (l-e)m(y(s)\n0)  + eH(k) 


(3.3.2) 


where  m(j/(s)|7r0)  and  S7T°(y(s))  are  given  by  (3.2.1)  and  (3.2.9)  respectively,  and 


1 rpo+k 

H(k)  = - j _ f(y{s)\P)d/3  lf*/0 


2k  J0o-k 

= f{y(s) \p0) 


if  k = 0; 


(3.3.3) 


i rPo+K 

HAk)  = uJy0-k  0/foOOI/W  if  MO 

= A>/(l/(s)|#>)  if  A:  = 0. 


(3.3.4) 


Using  the  expression  /(j/(s)|/?)  = (2na2)~*  (ILe.  5)exp[-^  E^fa-fo)2/**],  it 
follows  after  some  simple  algebra  that 


H(k)  = exp[-—Y,(Vi- Xiy(s)/X(s))2/Xi\(2k)  '(Zna2)  nil  (Hxi  2) 


l£s 


iGs 


cr(nx(s))-1/2 


cr(nx(s))-1/2 


and 


ffi(A) 


H(k)y(s)/x(s)  + exp[--^  £)(%  ~ ®il/(s)/*(s))2/*t] 

/cr  ies 

x(2/c)_1(27rcr2)  ~ (IX  2)^{nx(s))~l 


k^-y{s)/x{s)  _ /30-k-y{s)/x{s ) 

a(nx(s))-ll2  ' ' cr(nz(s))-1/2 


(3.3.6) 


Write  * = z(s)  = ( y(s)/x(s)-po)(nx(s)y/2/a , k0  = k(nx{s)y/2 / a,  A = 0(fco-z(s))  + 
(j)(-k0  - z(s)),  « = (f>(k0  - z{s ))  - (j)(-k0  - z(s)),  vu  = $(k0  - 2(5))  - $(-fc0  - *(«)), 
a — £_1(1  — e)m.(j/(s) |7r0)  and  = aSir°(y(s)).  Now,  using  (3.3.2)  - (3.3.6)  and  solving 
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Jr[(ai  + Hi{k))/(a  + H{k ))]  = 0,  it  follows  after  some  heavy  algebra  that 

sup  E[y(y)\s,  y(s)]  = fy{s ) + (1  - f)x(s)(ax  + Hx(ku))/(a  + H(kv))  (3.3.7) 

Trgr 


and 


™lEh(y)\s,y(s)]  = fy(s ) + (1  - f)x(s)(ax  + Hx(kL))/(a  + H(kL ))  (3.3.8) 

where  ky  and  ki{<  ku)  are  the  two  solutions  of  the  quadratic  equation 

2 k[auk  + t(aP0  - aa)  + Guv/2] 

= [ wy(s)/x(s ) - au(nx(s))~1/2][Gt  4-  2 ao(nx(s))~l/2} 

- w(2aia(nx(s))~1/2  + G0ot),  (3.3.9) 

where 

G = expf-^  _ xiy(s)/x(s))2/rri](27ra2)-zl7i(na:r^)(nx(s))-1/2.  (3.3.10) 

The  formulas  (3.3.7)  - (3.3.10)  will  be  utilized  in  Section  3.4  for  numerical  computa- 
tions. 

Next  we  find  the  ML-II  prior  in  this  case.  Since  any  symmetric  unimodal  distri- 
bution is  a mixture  of  symmetric  uniform  distributions,  for  finding  the  ML-II  prior, 
it  suffices  to  restrict  oneself  to  Q't  = {Gk  : Gk  is  uniform  (/30 -k,(30  + k),k  > 0}.  The 
ML-II  prior  is  then  given  by 


7r*  = (1  - e)n0  + eqs, 


(3.3.11) 


where  qs  is  uniform  (/?o  — k,/3o  + k ),  k being  the  value  k which  maximizes  m(y(s)\q). 
To  find  k , first  write 


m(y(s)\q) 


(2 k)  / (2yra2)  2(JJ^2)exp[- 

Jfo~k  its 


2a2 


~ /3xi)2 / Xi\dl3 

its 
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= (27ra2)  (2k\J nx(s))  *)  exP[~^2  E(&  ~ «<y(»)/*(«))2/a:,-] 

i6s  ZCr  tes 

x [$(yjnx(s)k/a  - z(s ))  - <$>{-s]nx(s)k/a  - z(s))],  (3.3.12) 

where  z(s)  = ( nx{s)f/2(y{s)/x{s ) - /30)/a.  Differentiating  m(y(s)|g)  with  respect  to 
k and  letting  the  derivative  equal  to  zero,  it  follows  from  Berger  and  Sellke  (1987) 
that 

k = k*a/\Jnx{s)  if  |z(s)|  > 1 

= 0 if  |z(s)|  < 1,  (3.3.13) 

where  k*  is  a solution  of  the  equation 

$(**  ~ 1*001)  - H~k*  - |z(s)|)  = k*[(f)(k*  - |z(s)|)  + <f>(k * + |z(s)|)].  (3.3.14) 

Remark  3.3.1  Clearly  k*  = 0 is  a solution  of  (3.3.14).  Berger  (1985,  p234)  points  out 
that  there  exists  a unique  solution  k*(>  0)  of  (3.3.14)  for  |z(s)|  > 1.  Lemma  2.3.1 
contains  the  stronger  result  than  that  there  exists  a unique  solution  k*  > |z(s)|  of 
(3.3.14)  for  |z(s)|  > 1. 

Under  the  ML-II  prior  n*,  the  posterior  distribution  of  y(s ) given  (s,y(s))  is 

Ki.y(s)\s,y(s))  = \su(z(s))n0(y(s)\s,y(s))  + (1  - *su(z(s)))m(y\qs)/m(y(s)\qs), 

(3.3.15) 

where  for  k > 0, 

%£(*(*))  = l + e(l-e)“1(27r/Bo)1/2(V2)[0(^-^(s))  + (/)(r  + z(S))] 

x exp(B0z2(s)/2) 

= V(*00)  (say).  (3.3.16) 

while  for  k = 0, 


^ su(z(s ))  — 1+^(1— e)  exP(~(l  ~ B0)z2(s)/2) 

= ^2  0*00)  (say). 


(3.3.17) 
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The  robust  Bayes  predictor  of  7 (y)  under  the  ML-II  prior  n*  is  then  given  by 
esu(s,y(s)) 

= fy(s ) + (1  - f)x(s)[XMs)){(l  - B0(s))y(s)/x(s)  + B0(s)p0} 

+ (1  - Ai(z(s))){?j(s)/x(s) j==  tanh(A:*z(s))}]  (3.3.18) 

k*Jnx{s ) 

for  k > 0,  while  for  k = 0, 


esu(s,y(s )) 

= M»)  + (1  - f)x(s)[X2(z(s)){(l  - B0(a))y{8)/x(s)  + B0(s)p0} 

+(1  - A2(z(s)))A,].  (3.3.19) 

Also-,  generalizing  the  formula  (1.8)  in  Berger  and  Berliner  and  after  some  heavy 
algebra,  one  gets  the  associated  posterior  variance  given  by 


v(y  (y)\s,y(s)) 

= N~2[a2(N  - n)x(s)  + (N  - n)2x2(s) 


x{a2( 
+ Ai(l 


M„  + U(8)  + 75(7F  ta.nh(fc-^(S))(*(s)  - 1 tanh(***(»))))} 

- Ai ){B0(s)(y(s)/x(s)  - p0) ~~  tanh(A:*z(s))}2]  (3.3.20) 

k*Jnx(s ) 


for  k > 0,  while  for  k = 0, 


Vhiy)\s,y{s)) 

= N~2[a2(N  - n)x(s ) + (AT  - n)2x2(s){a2--  ^2  — + A2(l  - A2) 

M0  + nx(s) 

x (1  - B0(s))2(y(s)/x(s)  - po)2}}.  (3.3.21) 

Next  we  provide  expressions  for  the  indices  of  posterior  robustness  of  the  robust 
Bayes  predictors  proposed  in  this  section  under  the  {N(P0,  r2),  r2  > 0}  class  of  priors. 
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Calculations  similar  to  those  of  the  previous  section  provide  for  k > 0, 

P0Rr(e5t/) 

— (1  — f)2%2(s)  max[{B0(.s)\i(y(s)/x(s)  — p0)  + (1  — Ai) — tanh(fc*z(g))}2, 

k*Jnx(s) 

. {(Bo(s)\\  - 1 )(y(s)/x{s)  - po)  + (1  - Ai) ° — tanh(fc*z(s))}2],  (3.3.22) 

k*Jnx(s) 

while  for  k = 0, 

P0Rr(e5f/)  = (1  ~ffx2{s)  max[(l  - B0(s))2X2 , {1  - A2(l  - B0(s))}2](y(s)/x{s) -p0)2- 

(3.3.23) 

In  order  to  examine  the  procedure  robustness  of  esu,  first  note  that  under  the 
N(Po,t2)  prior  (denoted  by  £t2) 

r(£r2?  ®5l/)  r(Cr2>  Cb) 

= E[esu(s,  y{s ))  - eB(s,  y(s))]2 

= (t  - f)2x2{s)E[{XlB0(s)(y(s)/x{s)  - p0)  + (1  - Ai) _ tanh(k*z(s))}I^  , 

k*Jnx(s ) 1 J 

+ (1  - A2(l  - B0(s)))(y(s)/x(s)  - Po)I$=0]  - B(s)(y(s)/x(s)  - A>)]2.  (3.3.24) 

We  now  have  the  following  theorem. 

Theorem  3.3.1  r(£T2,  esu)  - r(£T 2,  eB)  = 0(B1!2). 

Proof  of  Theorem  3.3.1  First  use  the  inequality 
rhs  of  (3.3.24) 

< 3(1  - f)2x2(s)E[{X1B0(y(s)/x(s)  - p0)  + (1  - Ai) ° — tanh {k* z(s))}2 1$  Q] 

k*Jnx(s ) 1 J 

+(1  - A2(l  - B„))2(y(s)/i(s)  - A)2/[M  + B2(§(s)/x(s)  - ft,)2]. 


(3.3.25) 
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Next  observe  that 


E[B2{y(s)/x{s)  - A,)2]  = E[B2a2(nx{s)B)~lx\\  = Ba2/(nx{s))  = Oe{B)  (3.3.26) 
and 

m - Aa(l  - B0))2(y(s)/x(s)  - Po)2r[t=0]] 

= E[(  1 - A2(l  - B0))2(y(s)/x(s)  - P0)2I[z 2(,)<1]] 

__  1 Bp ,2  ^ 2 t i 

1 + #exp(-(l  - B0)xi/(2B))^  nx(s)BXl 

(. 9 = e(l  +e)_1i5o_1/2) 


< 


< 


^ c[[]  | (1  - Bo)2 2 

nx(s)B  Ll  (1  + gexp(— (1  - B0)xj/ (2B)))2  }Xl 

a e\[  1 -i (1  ~ Bp)2 „ 

nx(5)B  Ll  2^exp(-(l -B0)a?/(2B)))/Xi 

0 + (1- ^^((1-^/2)  (3.3. 27) 


nx(s)B 


Note  that 


E(.Xi 7[x?<b])  = / {xexp(-x/2)(a;/2)1/2  1/(2r(l/2))}drrr 

« 0 

= (27T)-1/2  [B  xx!2dx  = (27t)-1/2(|b3/2).  (3.3.28) 

v 0 o 

Combine  (3.3.27)  and  (3.3.28)  to  get 


lhs  of  (3.3.27)  = 0{B1/2). 


(3.3.29) 


Next  use  the  inequality 


E[{XiB0(y{s)/x(s)  - fa)  + (1  - \i)- 


o 


:tanh (k'zis))}2!,?  J 


< 


k*^Jnx(s)  V >n  [fc>0^J 

2El{^B20(y(s)/x(s)  - Ao)2  + (1  - X\)2a2(k*\Jnx(s))~2  tanh2(fc*2:(s))}7[^>0]]. 


(3.3.30) 
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Now,  writing  g'  = gy/ 2n, 


mBl(y(s)/x(s)  - A, )%0]] 

= #o  (^V  nx(s))22(s)/[z2(s)>1]] 


< ^,r  B$(a2 /nx(s))z2(s) 

“ L1  +g'exp(Boz2(s)/2)((t)(k*  - |z(s)|)  + <j>{k*  + k(s)|))/[*2(*)>lll3‘3’31) 
Let  K = max(M0,  Mi,  2).  Then  writing  g"  = c0g'  and  using  (i)  of  Lemma  2.3.2, 


rhs  of  (3.3.31) 

< £[B0V/-W).2M{ WW<K!]  + -+a.atp(^(.)/2)jfV,„>,1H 

< S2(a2/nx(s))T;[B-1x?/[B<x2<if2B] 

+ k*z2(s)/ (k*  + p"  exp(Bo-22(s)/2))/[z2(5)>^2]].  (3.3.32) 


e[x2J{b<x\<k*b}]  = / {xexp(-a;/2)x1/2_1/V/27r}^ 

J B 

< ^B3'2(K3  - l)/y/2n.  (3.3.33) 

Also,  using  (iii)  of  Lemma  2.3.2,  and  \ogk*/k*  < 1 for  |z(s)|  > K 

E[k*z2(s){k*  + g"  exp(B0z2  (s) /2)}~l  /[z2(s)>k-2]] 

— (s)(2(s)  + c*(l°g A;*)1/2){A:*  + </' exp(B0z2(s)/2)}  1 ^{z2(s)>k2]] 

< E[{\z(s) \3/{g"  exp(B0z2(s)/2))}I[z2{s)>K2] 

+ctz2(s){2i/g"exp(B0z2(s)/A))-1Im>K2]].  (3.3.34) 

But, 


E[\z(s)\3exp(-B0z2(s)/2)I[Z2{S)>K2]] 
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E[{X\/B)  ! exP(  2^°Ai/^K[x?>a'2b]] 

pOO  1 1 

= B_3/2  L*3'2  exp(~wB°X  ~ 2x^xl/2~1^~1/2dx 

< (2tt)-1/2B~3/2  ( a:exp(-io:(:^  4-  1 ))dx 
Jo  ZB 

= (2tt)-1/2B-3/24(§  + l)-2  = 0{B1!2). 

B 


Moreover, 


E[z2(s)  exp(-^-z2(s))I[z2{s)>K2]] 

= E[(xVB)exp(-~x21)I[x2>K2B]} 

= S_1  Cb  6Xp(“f + i))*172^)"172^ 

= 0(B1'2). 

Combine  (3.3.34)  - (3.3.36)  to  conclude  that 

E[A?Bj(S(s)/i(s)  - A)2/ff>0]]  = 0(B‘/2). 

Finally, 


£[(1-Ai)2 


< 


a 


< 


nx{s) 

T^_ 

nx(s) 


77  -77-37  : tanh2(/c*z(s))/r-r  nll 
( k*)2nx(s ) v v " [fc>°lJ 

•E[(/c*)-2tanh2(A;^(s))7[|z(s)|>1]] 
■^(*)W«|<*]  + (^*)~2%k(s)|>.R:]], 


(3.3.35) 


(3.3.36) 


(3.3.37) 


(3.3.38) 


where  in  the  final  inequality  of  (3.3.38),  we  use  | tanh(fc*z(s))|  < k*\z(s)\  for  1 < 
|z(s)|  < K and  | tanh(fc*z(s))|  < 1 for  |z(s)|  > K. 
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As  before 

E[z2(s)I[  i<|*(4)|<*]]  = 0(Bl/2).  (3.3.39) 

Also,  using  (ii)  of  Lemma  2.3.2, 

E[(*:*)-2/|k(,)l>if]] 

< b[(^(»))->V(.)>KS)]  = £[B(x;)-Vm>k,b]] 

roc  i 

= B x~l  exp(--rr)x1/2_1(27r)_1/2c?x 
Jk2b  2 v ' 

n 00 

< B^n)-1'2  JK2BX-3'2dx  = 2B(2tt)-1/2(K2 B)~1/2 

= 0(B1'2).  (3.3.40) 

From  (3.3.38)  - (3.3.40), 

lhs  of  (3.3.38)  = 0{B1/2) 

Combine(3.3.25),  (3.3.26),  (3.3.27),  (3.3.29),  (3.3.30),  (3.3.37),  (3.3.38),  (3.3.39)  and 
(3.3.40)  to  get  the  theorem. 

Remark  3.3.2  It  follows  from  the  above  theorem  that  as  n — » oo,  i.e.  B(s)  -4  0,  under 
the  subjective  N(fi0,  r2)  prior,  e$u,  the  robust  Bayes  estimator  of  y(y)  = N~l  Ya=\  Vi 
is  asymptotically  optimal  in  the  sense  of  Robbins  (1955). 

3.4  An  Example 

The  example  in  this  section  considers  one  of  the  six  real  populations  which  are  used 
in  Royall  and  Cumberland  (1981)  for  an  empirical  study  of  the  ratio  estimator  and 
estimates  of  its  variance.  Our  population  consists  of  the  1960  and  1970  population,  in 
millions,  of  125  US  cities  with  1960  population  between  100,000  and  1,000,000.  Here 
the  auxiliary  information  is  the  1960  population.  The  populations  of  different  cities 
are  shown  in  Figure  3.1. 
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The  problem  is  to  estimate  the  mean  (or  total)  number  of  inhabitants  in  those 
125  cities  in  1970.  For  the  complete  population  in  1970,  we  find  that  the  population 
mean  is  0.29034.  We  select  20%  simple  random  sample  without  replacement  from  this 
population.  So  the  sample  size  is  n=25.  Also,  we  are  using  a 2 = (N  — l)-1  J2iLi  (Vi  ~ 
0Xi)2  = 4.84844xl0-3  which  is  assumed  to  be  known.  We  can  obtain  easily  the  ratio 
estimate  and  corresponding  standard  error.  To  do  a Bayesian  analysis,  we  use  both 
1950  and  1960  populations  in  125  cities  to  elicit  the  base  prior  7 r0  for  (3.  The  elicited 
prior  7r0  is  the  N(l. 15932,  1.21097x  10~3)  distribution  based  on  prior  information. 
Under  this  elicited  prior  7To,  we  use  formulas  (3.1.2)  and  (3.1.3)  to  obtain  the  subjective 
Bayes  predictor  and  the  associated  posterior  variance.  But  we  have  some  uncertainty 
in  7Tq  and  the  prior  information,  so  we  choose  e = .1  and  we  get  the  robust  Bayes 
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Table  3.1.  Predictors,  Associated  Standard  Errors  and  Posterior  Robustness  Index 


Predictor 

SE 

It (y)  -e\ 

POR 

eR 

0.28426 

1.10032xl0-2 

6.08452xl0-3 

5.38660xl0-4 

eB0 

0.29336 

5.61481  xl0~3 

3.01418xl0-3 

3.27488xl0~4 

£rb 

0.28660 

5.49854xl0~3 

3.74880  xlO-3 

4.35696  xlO”4 

esu 

0.29027 

5.74954xl0-3 

7.83722  xl0~5 

3.29777xl0-4 

predictors  and  the  associated  posterior  variances  using  formulas  (3.2.16),  (3.2.17), 
(3.3.18)  and  (3.3.20).  For  illustration  purpose,  we  have  decided  to  report  our  analysis 
for  one  sample.  Table  3.1  provides  the  classical  ratio  estimate  eR,  the  subjective 
Bayes  predictor  eBo,  the  robust  Bayes  predictor  eRB  with  all  possible  contaminations, 
the  robust  Bayes  predictor  esu  with  all  symmetric  unimodal  contaminations  and  the 
respective  associated  standard  errors.  Table  3.1  also  provides  the  posterior  robustness 
index  for  each  predictor  which  in  a sense  the  sensitivity  index  of  the  predictor  as  the 
prior  varies  over  the  class  {N(/30,  r2),  r2  > 0}. 

An  inspection  of  Table  3.1  reveals  that  the  robust  Bayes  predictors  eRB  and  esu 
are  well  behaved  in  the  sense  that  eSu  is  closest  to  7 (y)  and  both  eRB  and  eSu  are 
closer  to  7 (y)  than  the  classical  ratio  estimate  eR  . The  subjective  Bayes  predictor 
eBo  is  good  in  the  sense  of  the  posterior  robustness  index.  Note  that  eR  is  worst  in 
both  the  closeness  to  7 (y)  and  the  posterior  robustness  index. 

Also  we  find  that  the  posterior  mean  of  7 (y)  = N~l  ZiLi  Vi  is  m the  interval 
(0.28363,  0.29428)  for  all  possible  contaminations  and  in  the  interval  (.29003,  .29364) 
for  all  symmetric  unimodal  contaminations.  Note  that  the  range  of  the  posterior 
mean  of  7 (y)  = N~l  Vi  is  fairly  small  for  both  cases.  So  if  we  feel  that  the  true 
prior  is  close  to  a specific  one,  say  7T0,  we  should  model  via  one  of  the  contamination 
models,  and  achieve  a very  robust  inference. 


CHAPTER  4 

BAYESIAN  ANALYSIS  UNDER  HEAVY-TAILED  PRIORS 


4.1  Introduction 

In  this  chapter,  we  consider  the  idea  of  developing  priors  that  are  inherently  robust 
in  some  sense.  The  idea  is  that  it  is  perhaps  easier  to  build  robustness  into  the  analysis 
at  the  beginning,  than  to  attempt  verifying  robustness  at  the  end. 

Substantial  evidence  has  been  presented  to  the  effect  that  priors  with  tails  that 
are  flatter  than  those  of  the  likelihood  function  tend  to  be  fairly  robust  (e.q.,  Box 
and  Tiao  (1968,  1973),  Dawid  (1973),  O’Hagan  (1979,  1989)  and  West  (1985)).  It 
is  thus  desirable  to  develop  fairly  broad  classes  of  flat-tailed  priors  for  use  in  “stan- 
dard” Bayesian  analyses.  Andrews  and  Mallows  (1974)  and  West  (1987)  studied  scale 
mixture  of  normal  distributions  which  can  be  used  for  simulation  and  in  the  anal- 
ysis of  outlier  models.  The  Student  t family,  double-exponential,  logistic,  and  the 
exponential  power  family  can  all  be  constructed  as  scale  mixtures  of  normals.  The 
exponential  power  family  was  introduced  and  popularized  by  Box  and  Tiao  (1973) 
in  the  context  of  Bayesian  modelling  for  robustness.  Recently,  Angers  and  Berger 
(1992)  and  Angers (1992)  considered  t priors  in  the  hierarchical  Bayes  setting,  while 
Datta  and  Lahiri  (1994)  considered  general  scale  mixtures  of  normals  primarily  with 
the  end  of  outlier  detection  in  the  context  of  small  area  estimation. 

The  price  to  be  paid  for  utilization  of  inherently  robust  procedures  is  computa- 
tional; closed  form  calculation  is  no  longer  possible.  Recently,  however,  the  Markov 
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chain  Monte  Carlo  integration  techniques,  in  particular  the  Gibbs  sampling  (Geman 
and  Geman  (1984),  Gelfand  and  Smith  (1990),  and  Gelfand  et  al.  (1990))  has  proved 
to  be  a simple  yet  powerful  tool  for  performing  robust  Bayes  computations. 

Ericson  (1969)  considered  the  superpopulation  model  y{  = 9 4-  e*,  where  9,  eu  ■ ■ ■ , eN 
are  independently  distributed  with  9 ~ N(/j,0,Tq)  and  ej’s  are  iid  N(0,  a2).  As  we 
have  seen  in  Chapter  2,  under  the  N(h0,Tq)  prior,  the  Bayes  estimator  of  7 (y)  = 
N-1  Eila  ft  is 


<y0(5»  y(s))  = fy(s ) + (!-/){(!-  Bo)y(s)  + B0/io}.  (4.1.1) 

Recall  that  / = n/iV,  Mo  = ^/tq  and  Bq  = Mq/{Mq  + n).  For  unknown  a2,  a 
normal-gamma  prior  was  used. 

The  purpose  of  this  chapter  is  to  develop  inherently  robust  Bayes  procedures  to 
overcome  the  problem  associated  with  outliers  in  the  context  of  finite  population 
sampling.  We  consider  a refinement  based  on  heavy-tailed  prior  distributions  on  9 
using  scale  mixtures  of  normals  for  both  known  and  unknown  a2.  We  use  the  same 
notations  as  in  Chapter  2. 

The  outline  of  the  remaining  sections  is  as  follows.  In  Section  4.2,  we  provide 
the  robust  Bayes  estimators  of  7 (y)  based  on  heavy-tailed  prior  distributions  using 
scale  mixtures  of  normals  when  a2  is  known.  Also,  the  asymptotic  optimality  (A.O.) 
in  the  sense  of  Robbins  (1955)  of  proposed  robust  Bayes  estimators  is  proved.  The 
above  program  is  repeated  in  Section  4.3  with  the  exception  that  a2  is  unknown. 
Once  again,  robust  Bayes  estimators  are  proposed  using  scale  mixtures  of  normals, 
and  their  A.O.  property  is  studied.  Finally  in  Section  4.4,  a numerical  example  is 
provided  to  illustrate  the  results  of  the  preceding  sections. 
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4.2  Known  Variance 

Consider  the  case  when  (i)  y{  | 0 ~ N(9,a2)  (i=l,  N)  and  (ii)  6 ~ -A-p(£^*a) 

where  p(x)  — /0°°  A2  0(:rA2)<7(A)  dX,  that  is,  p(-)  is  a scale  mixture  of  the  normal 
distribution  with  mixing  distribution  g(-).  Note  that  we  can  write  (ii)  in  the  following 
two  steps;  (iia)  6 | A ~ iV(//0,  A-1)  and  (iib)  A ~ r02c/(r02 A)  where  /0°°  g(x)dx  = 1.  The 
following  list  identifies  the  necessary  functional  form  for  g{ A)  to  obtain  a wide  range 
of  densities  which  represent  departures  from  normality: 

t-priors:  If  kX  ~ xl  then  $ is  Student  t with  k degrees  of  freedom,  location 
parameter  p0,  and  scale  parameter  r0. 

double-exponential  priors:  If  1/A  has  exponential  distribution  with  mean  2 then 
0 is  double-exponential  with  location  parameter  go  and  scale  parameter  To. 

exponential  power  family  priors:  If  A has  positive  stable  distribution  with  index 
a/2  then  6 has  exponential  power  distribution  with  location  parameter  p0  and  scale 
parameter  to- 

logistic  priors:  If  \/A  has  the  asymptotic  Kolmogorov  distance  distribution  then  6 
is  logistic  with  location  parameter  p0  and  scale  parameter  r0.  [A  random  variable  Z 
is  said  to  have  an  asymptotic  Kolmogorov  distance  distribution  if  it  has  a pdf  of  the 
form  f(z)  = 8zE°°=1(-ir1i2exp(-2iV)/(0iOo)(z)  ]. 

We  shall  use  the  notations  y(s)  = n~l  EstsVi  and  y(s)  = {yi  : i & s},  the  suffixes 
in  y(s)  being  arranged  in  the  ascending  order.  Then  the  posterior  distribution  of  y(s ) 
given  by  s and  y(.s)  is  obtained  as  follows: 

(i)  conditional  on  .s,  y(s)  and  A,  y(s)  has  N((B(X)p0  + (1  - B(X))y(s))lN_n, 

° {I N—n  + Ijv-nlAf-n)))  where  B( A)  = Aa2/(Acr2  -f-  n); 

(ii)  the  conditional  distribution  of  A given  s and  y(s)  has  pdf 

f(\\s,y(s))  oc  (a2  + nA_1)“1/2exp[-|^/i— pjy]  «(t02A). 


(4.2.1) 
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Note  that  under  the  posterior  distribution  given  in  (4.2.1),  the  Bayes  estimator  of 
7 (y)  is  given  by 

ssm(s,v(s))  = E[y(v)\s,y(s)] 

= fy{s)  + (1  - f){E[B( A)  | s,  y(s)]  /x0  + (1  - E[B( A)  | s,  y(s)])  y(s)}.  (4.2.2) 
Also,  one  gets 

^(7 (y)  I s,y(s)) 

= E[V  (7  (y)  | s,  1/(5),  A)  | s,  y(.s)]  + F[  E(-y(y)  | s,  y(s),  A)  | s,  y(s)] 

= AT-2<72{(JV  — n)  4-  (AT  — n)2i?((Acr2  + n)_1  | s,  y(s))} 

+(1  ~ f)2V(B( A)  //o  + (1  - 5(A))  y(s)  | s,y(s)).  (4.2.3) 

The  calculations  in  (4.2.2)  and  (4.2.3)  can  be  performed  using  one-dimensional 
numerical  integration.  Alternately,  one  can  use  Monte  Carlo  numerical  integration 
techniques  to  generate  the  posterior  distribution  and  the  associated  means  and  vari- 
ances. More  specifically,  in  this  chapter,  we  use  Gibbs  sampling  originally  introduced 
in  Geman  and  Geman  (1984),  and  more  recently  popularized  by  Gelfatid  and  Smith 
(1990)  and  Gelfand  et  al.  (1990).  Gibbs  sampling  is  described  below. 

Gibbs  sampling  is  a Markovian  updating  scheme.  Given  an  arbitrary  starting 
set  of  values  , • • • , , we  draw  ~ \U\  \ U^,-  • • , E/^],  ~ [U2  \ 

• ,1/^  ~ [Uk  | U^\-  • • , U^],  where  [•  | •]  denotes  the  rel- 
evant conditional  distributions.  Thus,  each  variable  is  visited  in  the  natural  order 
and  a cycle  in  this  scheme  requires  k random  variate  generations.  After  t such  it- 
erations, one  arrives  at  (U}*\  ■ • - , U®).  As  t-t  00,  (U? , • • • , ujp)  -4  ■ ■ ■ ,Uk). 

Gibbs  sampling  through  q replications  of  the  aforementioned  t-iterations  generates  q 

k-tuples  (Uy  , • • • , Ukj)  (j  = 1,-  • -,q)  for  t large  enough.  Ui,---,Uk  could  possibly  be 
vectors  in  the  above  scheme. 
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Gelman  and  Rubin  (1992)  adapt  multiple  sequences,  with  starting  points  drawn 
from  an  overdispersed  distribution,  to  monitor  the  convergence  of  the  Gibbs  sampler. 
Specifically,  m > 2 independent  sequences  are  generated,  each  of  length  2d.  But  to 
diminish  the  effect  of  the  starting  distribution,  the  first  d iterations  of  each  sequence 
are  discarded.  Hence,  we  have  m x d simulated  values  for  each  parameter  of  interest. 
Using  Gibbs  sampling,  the  posterior  distribution  of  y(s ) is  approximated  by 

m d 

(md)~l  H [l/(«)  I v(s),  0 = %,  A = Ay] . (4.2.4) 

t=i  j=i 

To  estimate  the  posterior  moments,  we  use  Bao-Blackwellized  estimates  as  in  Gelfand 
and  Smith  (1991).  Note  that  E[y(y)  | s,  y(s)]  is  approximated  by 

m d 

ft M + ( W)  ( md )-’  •£  Y.  (S(A«)  to  + (1  - B( Ay))  y(«)) . (4.2.5) 

i= 1 j=l 

Next  one  approximates  V(y (y)  \ s,y(s))  by 

m d 

N~2a2{(N  — n)  + (N  — n)2  (i md)~l  ^ ^(A y-a2  + n)-1} 

i=l  j=  1 

to  d 

+(1  - /)2[(mrf)-1  Y Y(B(^M  + (1  - B(A«))S(»))2 

1=1  j= 1 

to  d 

-{H-'EEW*  + (1  - B(A«))f(s)))2].  (4.2.6) 

i=l  1=1 

The  Gibbs  sampling  analysis  is  based  on  the  following  posterior  distributions  : 

(i)  9 | s,  y(s),  |/(S),  A ~ N [(A//0  + £t=:  Vi/a2)/( A + N/a 2),  (A  + fV/a2)-1] ; 

00  /(A  | 5,?/(s),i/(s),6>)  oc  \/Aexp[-|(6»  - Mo)2]  m(t02A); 

(iii)  y(5)  | s,  3/(s),0,  A ~ fV[01,v-„,  a2/N_n], 

Note  that  if  /cA  ~ xl  then  /(A  | s,y(s),y(s),9)  reduces  to  a Gamma(|{rQA:  + 
(0  - /i0)2},  + 1))  density.  [A  random  variable  W is  said  to  have  a Gamma(a,  /3) 
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distribution  if  it  has  a pdf  of  the  form  f(w)  oc  exp {-aw)wp~l  J(0,oo)(w),  where  / 
denotes  the  usual  indicator  function].  Also,  if  1/A  has  exponential  distribution  with 
mean  2 then  /(A  | s,y(s),y(s),$)  reduces  to  a IGN(l/-y/ro2(0  - /x0 ) 2 , 1 /r02)  density. 
[A  random  variable  V is  said  to  have  a IGN (771 , 772)  distribution  if  it  has  a pdf  of  the 


We  shall  now  evaluate  the  performance  of  the  robust  Bayes  estimator  8lSM  of  9 for 
large  n under  the  N(/iq,Tq)  prior,  say  7r0.  The  Bayes  estimator  of  9 under  this  prior 
is  £°  which  is  given  by  (4.1.1).  Let  r(7To,5)  denote  the  Bayes  risk  of  an  estimator  S of 
9 under  the  prior  n0.  Our  aim  is  to  show  that  r(7T0,  SgM)  - r(7r0, 5°)  ->  0 as  n ->  00. 

Lemma  4.2.1  Assume  E( A3/2)  < 00.  Then  E[B{ A)  | s,y(s)]  4 0 as  n ->  00. 

Proof  of  Lemma  4.2.1  Note  that 


E[B(X)  \ s,y(s)]  = a1  f°° (a2  + nA-1)-3/2  exp[ 

J 0 


1/2 


E 


A1/2 i 

1 + n(a2)_1A“1 


1/2 


(4.2.7) 
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Now,  Pn<n  1<j2E(A3/2)  4 0,  if  E( A3/2)  < cm.  Also, 

Qn  = E[(n~xa2 A + l)’1/2  A1/2  exp(-^((y(a)  - /r0)2)].  (4.2.8) 

Note  that  j/(.s)  — /x0  is  the  centered  mean  of  an  exchangeable  sequence  of  random 
variables,  and  hence,  is  a centered  backward  martingale.  Hence,  ( y(s ) — /x0)2  is 
a backward  submartingale.  Since  limsup E(y(s)  - /x0)2  = r02  < +oo,  by  the 
submartingale  convergence  theorem,  ( y(s ) — //0)2  converges  a.s.  to  a rv,  say  Yq. 
Hence,  using  Fatou’s  lemma 


!im  inf  Qn  > E[ A1/2  exp(-^y0)],  (4.2.9) 

where  the  lower  bound  is  bounded  away  from  zero  a.s..  Hence,  Pn/Qn  4 0 as  n — 4 oo. 

We  now  turn  to  the  theorem  which  proves  the  A.O.  property  of  SXSM  obtained  in 
(4.2.2). 


Theorem  4.2.1  Assume  E( A3/2)  < oo.  Then  r( n0,SxSM)  - r(7r0,5°)  0 as  n -4  oo. 

Proof  of  Theorem  4.2.1  Standard  Bayesian  calculations  yield 

r(*o ,4m)  - r(7r0)£°)  = £(<^M  - 60)2 
= (1  - f)2E[(E(B( A)  | a,y(a))  - B0)2  ($(*)  - /k>)2].  (4.2.10) 

By  Lemma  4.2.1,  E[B( A)  | s,y(s)]  4 0 as  n — > oo.  Also,  Bo  -4  0 as  n — 4 oo.  Hence, 

(E{B( A)  | s,y(s))  — Bo)2  -4  0 as  n — > oo.  Also,  |B(B(A)  | s, y(s))  — Bo|  < 1 and 
( y{s ) ~ H o)2  being  a backward  submartingale  is  uniformly  integrable.  Hence,  the  rhs 
of  (4.2.10)  -4  0 as  n — > oo.  This  completes  the  proof  of  the  theorem. 
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4.3  Unknown  Variance 


In  this  section,  somewhat  more  realistically,  we  consider  the  normal  superpopu- 
lation model  with  unknown  mean  6 and  unknown  variance  r-1.  Ericson  (1969)  used 
a normal-gamma  prior  on  (0,r)  in  this  setting.  That  is,  9 \ r ~ iV(/x0,  r_1r02)  and 
r ~ Gamma(|ao,  |<7o)-  But  in  this  case  the  ratio  of  model  variance  and  prior  variance 
is  known. 

Suppose  now,  more  generally,  that  yt  \ 9,  r ~ N(9,  r"1)  (i  = 1,  • • • , N),  and  6 and 
r are  independently  distributed  with  6 ~ AT(/i0,r02)  and  r ~ Gamma(|a0,  \gQ).  Then 
the  posterior  distribution  of  y{s)  given  s and  y(s)  is  obtained  via  the  following  two 
steps  : 

(i)  conditional  on  s,y(s)  and  r,  y(s)  has  N((B(r)n0  + (1  - B{r))y(s))lN_n, 
r~l{lN-n+  M(r)+n1N-nlN-n))i  where  M(r)  = r~l /tq  and  B(r)  = M(r)/(M(r)  + n). 

(ii)  the  conditional  distribution  of  r given  s and  y(s)  has  pdf 


f(r\s,y(s)) 


oc 


r(n+go~2)/2 
(1+nTg  r)1/2 


exp[-|{a0  + E iesiVi  ~ y{s))2  + 


(4.3.1) 


Note  that  under  the  posterior  distribution  given  (4.3.1),  the  Bayes  estimator  of 
7 (y)  is  given  by 


= E[y{y)  I s,y(s)] 

= fy(s)  + (!  - f){E[B{r)  I s, y(s)]fx 0 + (1  - E[B(r)  | s, y(«)])y(«)}.  (4.3.2) 
Also,  one  gets 


v(l(v)  I »■!/(»)) 

= E[  V(y (y)  I s,  y(s ),  A,  r)  | s , y(s)]  + V[  E('y(y)  \ s,  y(s),  A,  r)  | s,  y(s)] 

= N~*(N  - n)B[r->{l  + 7*(1  - B(r)}  | »,»(»)] 

+(1  - /)2(»M  - go)2V(B(r)  | s,  y(s)).  (4.3.3) 
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In  order  to  robustify  the  above  model,  consider  the  case  when  (i)  y{  \ $,r  ~ 
N(9,r~l)  (i  = (ii)  r ~ Gamma(Ia0i  |flo)  and  (iii)  6 ~ ^(^p-),  where 

p(x)  = /o°°  A2  0(zA2)^(A)dA.  Note  that  the  prior  pdf  of  6 does  not  depend  on  r. 
Recall  that  we  can  write  (iii)  in  the  following  two  steps;  (iiia)  9 | A ~ N(n o,  A-1)  and 
(iiibj  A ~ t02#(t02 A)  where  f0°°  g(x)dx  = 1. 

Then  the  posterior  distribution  of  y(s)  given  by  s and  y(s)  is  obtained  as  follows  : 

(i)  conditional  on  s,  y(s ),  r,  and  A,  y(s)  has  N((B(X,  r)/i0  + (l  -B(A,  r))y(s))ljv-n, 
r_1(Jw-n  + ^A_liv-nlw-n))>  where  #(A,  r)  = 1/(1  + nA_1r); 

(ii)  the  conditional  distribution  of  A and  r given  s and  y(s ) has  pdf 

f(X,r\s,y(s)) 

K exp[-|{oo  + E*,fo  - »(S))2  + gtfX).  (4.3.4) 

Note  that  under  the  posterior  distribution  given  in  (4.3.4),  the  Bayes  estimator  of 
7 (y)  is  given  by 

<*sm(s>I/(s))  = E[y(y)\s,y(s)} 

= Ms)  + (1  - f){E[B( A,  r)  | s, y{s)]p 0 + (1  - E[B(A,  r)  | s, y(s)])y(s)}{4.3.5) 
Also,  one  gets 

V{rt (y)  I s,y(s)) 

= E[V  (y(y)  | s,  y(s),  A,  r)  | s,  y(s)]  + V[  £(7(1/)  | s,  y(s ),  A,  r)  | s,  y(s)] 

= N-\N  - n)£J[r-Hl  + (iV  - n)^}  | a,  y(a)] 

+(1  - f)2  V (B( A,  r)  /*o  + (1  - B(A,  r))  y(s)  \ s,  y(s)).  (4.3.6) 

Using  Gibbs  sampling,  the  posterior  distribution  of  y(s)  is  approximated  by 

m d 

( md )_1  ^ [y(a)  | a,  y(a),  6>  = %,  A = Ay,  r = ry] . 

«=i  i=i 


(4.3.7) 
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To  estimate  the  posterior  moments,  we  use  once  again  the  Rao-Blackwellized  esti- 
mates as  in  Gelfand  and  Smith  (1991).  Note  that  E[y(y)  | s,y(s)]  is  approximated 
by 


m d 

fy(s ) + (!-/)  ( md)~l  Y yu0  + (1  - B(Xij,rij))  y(s )) . (4.3.8) 

t=i  j=  i 

Next  one  approximates  1/(7 (y)  | s,y(s ))  by 

m d 

N~2{N  - n)M_1  £ E[(rii)_1{l  + (^  - n)(Ali(rii)-1  + n)-1}] 

<=1  j= 1 

m d 

+(1  - f)2[(md)-'  Y,  £(«(-* + (1  - SfAj^Ms))2 
t=l  j=l 

m d 

-{( md y1  Y E(£(Ab>  ry)/io  + (1  - B(Aij,  ry))y(«))}2].  (4.3.9) 

i=l  j=l 

The  Gibbs  sampling  analysis  is  based  on  the  following  posterior  distributions  : 

(i)  9 I a,  y(s),  y(s),  X,r  ~ N [(A//0  + r Ejli  2/*)/(A  + riV),  (A  + rN)~l] ; 

(ii)  /(A  | s,y(s),y(s),9,r)  oc  v/Aexp[--|(6>  - /i0)2]  5(t02A); 

(iii)  r | a,y(s),y(a),A,0~  Gamma(i{oo  + E£i(l/<- + 

(iv)  2/(s)  | s,  y(s),  #,  A,  r ~ N[0lN_n,  r-1/jv_n]. 

Recall  that  if  /cA  ~ xl  then  /(A  | s,y(s),y(s),9)  reduces  to  a Gamma(|{roA:  + 
(9  — £fo)2},  |(&  + 1))  density.  Also,  if  1/A  has  exponential  distribution  with  mean  2 
then  /(A  | s,y(s),y(s),9)  is  reduced  to  a IGN(1  /\Jr$(9  - n0)2,  1/tq)  density. 

Now  to  evaluate  the  performance  of  the  robust  Bayes  estimator  5|M  of  0 for  large 
n,  we  denote  £ as  the  prior  under  which  9 and  r are  independent  with  9 rsj  N(h0,t2) 
and  r ~ Gamma(|a0,  |<7o)-  The  Bayes  estimator  of  9 under  prior  £ is  given  by  (4.3.2). 
Our  goal  is  now  to  show  that  r(f , 52SM)  - r(£,  SB)  -»  0 as  n -4  00. 
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Lemma  4.3.1  Assume  E( A3/2)  < oo.  Then  E[B{r)  | s,  y(s)]  4 0 as  n ->  oo.  Also, 
E[B(X,r)  | s,y(s)]  4 0 as  n -4  oo. 


Proof  of  Lemma  4.3.1  First  show  J5'[B(r)  | s,  y(s)]  4 0 as  n — > oo.  This  amounts  to 
showing  Nn/Dn  4 0 as  n — > oo,  where 


= / 
./0 


00  r(«+So-2)/2 

o (1  + titq  r)3/2 


exP[-F{°o  + E(^- ^(s))2  + 


iGs 


4?/(g)  - aq)s 

1 + titq  r 


}]dr  (4.3.10) 


and 


/•oo  r(n+9o-2)/2  n(fi(A  — uA2 

Dn  = l (1  + nr2r)i/2  exP[~2^a°  + T,(Vi  - ^(g))2  + - \ }]^r-  (4-3.11) 


iGs 


1 + utq  r 


Note  that 


Dr, 


roo  r(n+9o-2)/2  /.-j/'o’v  _ ..  \2 

-1  + (4-3.12) 


rG  s 


/o  (1+nrgr)1/2 

Hence,  defining  Wn  ~Gamma(4{a0  + Zi&iVi  ~ V(s))2},  \{n  + gQ-  1)), 


Nn 

-4  < E 
Dn  ~ 


7 ' 

42  1 

V1  + nr02W„y 

) 1 + nr2Wn 

-E 


Wn 


1/2' 


V1  +m$Wnt 


exp 


(y(s)  - /x0); 
2 r02 


< E 


nr2Wn  \1/2  1 


(l  + nr02Wn  y/  1 4-  nr^HL 


4-£ 


nroWn  \1/2' 


1 + nr02Wn/ 


exp  - 


jy(s)  - po)2 

2 Tn2 


£ (**!/)• 
tin 


(4.3.13) 


69 


Now,  An  < E[(l  + nrlWn)'1}  < n-\T%)-xE(W-x).  Note  that 


n 


-i  E{W~X)  = n~x  a°  ^(,s))  g 

n -\-  go  ~ 3 


(4.3.14) 


since  by  the  law  of  large  numbers  for  exchangeable  sequences,  4 R~x. 

Again, 


Bn  = E[(n  1(tq)  1 Wn  1 + 1 ) 1/2]  exp  | — 
> {n~l [rl)~x E(W~l)  + l)-1/2  exp  f- 


(y(s)-g  o)2^ 
2r02  ) 

{y{s)  - mo)2' 


2r,? 


using  Jensen’s  inequality  since  (a;  + 1)  */2  is  a convex  function  of  x.  Hence, 


(4.3.15) 


N, 


< n-x(r^-xE(W-x)(n-x(^)-xE(W-x)  + l^exp 


'{y{s)  - Mo)2  A ^ Q 


2r02 


(4.3.16) 


since  (y{s)  - Mo)2  = Op{  1)  and  n~1E(W~1)  ^ 0. 

Next  we  need  to  show  that  E[B( A,  r)  | s,  y(s)]  4 0 as  n -4  oo,  that  is,  N'n/D'n  4 0 


as  n — > oo,  where 


oo  roc 


r.(n+po-2)/2 


n(p(s)  - Mo)5 


*“/.  Jo  *«a)(IT^FT 


(4.3.17) 


and 


/•oo  /-oo  r(n+go— 2)/2 

d:  = / / r 


0 ao 


(T+  nA~1r)1/2  ^Ph^^+Efa-gW)^5^^}]^- 


(4.3.18) 


Note  that 


K> 


to9(to  A) 


r(n+go-2)/2 

(1  + nA-1?-)1/2 


exP[-o{«o+E(^-?/(s))2}- 


A(?/(s)  - Mo)5 


■]drdA. 


(4.3.19) 


70 


Hence,  defining  Wn  as  before, 


N' 

TV  * E 


Wn 


1/2 


1 


-i-E 


1 + n\-xWn)  1 + nX~lWn 

(i+„Tv) 7 exp  ^ 


< E 


nX~lWn 


1/2 


A1/2. 


1 


l+n\-lWnJ  1 + n\~lWn 


±E 


nX~lWn 
1 +nA-1H7l 


1/2 


A1/2exp  f-^(?/(s)  - K>)2 


^7  (say)> 


(4.3.20) 


where  expectation  is  taken  with  respect  to  the  joint  pdf 


/(A, w)  = ro#(ro  A)  exp[-^{a0  + £(&•  - t/(s))2}] 

i£s 


X 


. n+flfl-l 

w(n+g0-\)/2-l  ( Oo+^i£i(y^-S(s))2  ^ 2 


r(n+g0-l  ^ y 2 


(4.3.21) 


Now, 


A'n  < E[Xll2n-xXW-1}  = n-lE{X^2)E{W~x) 

= n~xE( A3/2)  °°+^g(oyi3y('))2  ^ 0,  (4.3.22) 

if  £(A3/2)  < oo.  In  the  above,  we  have  used  independence  of  Wn  and  A.  Again, 

K = E{(n~'XW^  + I)"'/2 W2  exp(-i(y(S)  - p0)2)] 

= + (4.3.23) 


71 


Since  (x  + 1)  :/2  is  a convex  function  of  x,  using  Jensen’s  inequality  for  conditional 
expectations, 

EKn-'AW-1  + I)'1/2  | A]  > [E(n-lXW-1  | A)  + 1 ]-V2 

= (n^XEiW-1)  + l)"1/2.  (4.3.24) 

Hence, 


= mAfl0  E EiesiVi  - y{s))2 
n n + go  — 3 


+ !)  V2aV2  exp(— ^(y(s)  - //0)2)] 


(4.3.25) 


By  the  same  argument  in  Lemma  4.2.1,  ( y(s ) — //0)2  converges  a.s.  to  a rv,  say  F0'. 
Hence,  using  Fatou’s  lemma,  it  follows  from  (4.3.25)  that 


Un™ Sf  K > £[A1/2exp(~yo')]. 


(4.3.26) 


Hence,  B'n  is  bounded  away  from  zero  a.s.  in  the  limit.  Hence  A'n/B'n  A 0 so  that 

K/K  4 o. 


We  now  turn  to  the  theorem  which  proves  the  A.O.  property  of  52SM  obtained  in 
(4.3.5). 


Theorem  4.3.1  Assume  E( A3/2)  < oo.  Then  r(£,5fM)  - r(£,8B)  ->  0 as  n — > oo. 
Proof  of  Theorem  4.3.1  Standard  Bayesian  calculations  yield 
»■(*,  %m)  ~ = E(S2sm  - SB)2 

= (1  - f)2E[{E[B( A,  r)  | s,  y(s)]  - E[B{r)  | s,  y(s)]}2  (y(s)  - yu0)2]-  (4.3.27) 
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By  Lemma  4.3.1,  E[B(\,r)  | s,r/(s)]  4 0 as  n oo  and  E[B(r)  \ s,y(s)]  4 0 as 

n ->  oo.  Hence,  {£[B(A,r)  | s,y(s)]  - E[B(r)  | s,y(s)]}2  4 0 as  n ->  oo.  Also, 
|E[5(A,r)  | s,  y(s)]  — E[B(r)  \ s,  r/(s)]|  < 1 and  (y(s)  — p0)2  is  uniformly  integrable. 
Hence,  the  rhs  of  (4.3.27)  -»  0 as  n — > oo.  This  completes  the  proof  of  the  theorem. 

4.4  An  Example 

We  illustrate  the  methods  of  Section  4.2  and  4.3  with  an  analysis  of  data  in 
Cochran  (1977).  The  data  set  consists  of  the  1920  and  1930  number  of  inhabitants, 
in  thousands,  of  64  large  cities  in  the  United  States.  The  data  were  obtained  by  taking 
the  cities  which  ranked  fifth  to  sixty-eighth  in  the  United  States  in  total  number  of 
inhabitants  in  1920.  The  cities  are  arranged  in  two  strata,  the  first  containing  the  16 
largest  cities  and  the  second  the  remaining  48  cities.  But  for  our  purpose,  we  just  use 
the  second  stratum  only.  For  the  complete  population,  we  find  the  population  mean 
to  be  197.875  and  the  population  variance  5580.92.  We  use  the  1920  data  to  elicit 
the  prior  in  our  setting  so  that  /x0  = 165.438  and  r02  = 71.424.  We  want  to  estimate 
the  average  (or  total)  number  of  inhabitants  in  all  48  cities  in  1930  based  on  a sample 
of  size  16  (i.e.  1/3  sample).  For  illustrative  purposes,  we  have  decided  to  report  our 
analysis  for  one  sample. 

In  deriving  the  robust  Bayes  estimates  based  on  heavy-tailed  prior  distributions 
using  scale  mixtures  of  normals,  we  have  considered  Gibbs  sampler  with  10  indepen- 
dent sequences,  each  with  a sample  of  size  5000  with  a burn  in  sample  of  another 
5000'. 

Table  4.1  provides  the  Bayes  estimates  of  7 (y)  and  the  associated  posterior  stan- 
dard deviations  for  the  normal,  double  exponential  and  t priors  with  degree  of  freedom 
1,  3,  5,  10  and  15  in  both  known  and  unknown  a2  cases.  For  the  unknown  a2  case,  we 
have  used  a0  = g0  = 0 to  ensure  some  form  of  diffuse  gamma  prior  for  the  inverse  of 
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the  variance  component  in  our  superpopulation  model.  Note  that  the  naive  estimate, 
that  is,  the  sample  mean  is  207.69. 

An  inspection  of  Table  4.1  reveals  that  there  can  be  significant  improvement  in 
the  estimate  of  7 (y)  by  using  heavy-tailed  prior  distributions  rather  than  using  the 
normal  prior  distribution  in  the  sense  of  the  closeness  to  7 (y).  For  instance,  using 
the  double  exponential  and  the  t(l),  t(3),  t (5) , t(10)  and  t (1 5)  priors,  the  percentage 
improvements  over  the  normal  are  given  respectively  by  45.78%,  89.05%,  52.06%, 
30.68%,  15.53%  and  9.06%  for  the  known  a 2 case.  Here  the  percentage  improvement 
of  ei  over  e2  is  calculated  by 


((e2  - truth)2  - ( e-i  - truth)2)/(e2  - truth)5 


where  e\  is  the  robust  Bayes  predictor  based  on  heavy-tailed  prior  distributions  and 
e2  is  the  Bayes  predictor  using  the  normal  prior.  Also  as  one  might  expect,  flatter  the 
prior,  closer  is  the  Bayes  estimates  to  the  sample  mean.  In  general,  for  most  cases  we 
have  considered,  the  Cauchy  prior  (i.e.,  t prior  with  1 degree  of  freedom)  leads  to  an 
estimate  which  is  closest  to  the  population  mean. 

We  adopt  the  basic  approach  of  Gelman  and  Rubin  (1992)  to  monitor  the  conver- 
gence of  the  Gibbs  sampler.  For  6,  we  simulate  m = 10  independent  sequences  each 
of  length  2d  = 10000  with  starting  points  drawn  from  a t distribution  with  2 degree 
of  freedom.  The  justification  for  t distributions  as  well  as  the  choice  of  the  specific 
parameters  of  this  distribution  are  given  below. 

First  note  that  from  the  posterior  distribution  of  A given  s and  y(s)  as  given  in 
(4.2.1),  we  find  the  posterior  mode,  say  A by  using  the  Newton-Raphson  algorithm. 
Also,  we  use  y(s)  for  £ s based  on  sample.  We  can  now  very  well  use  the 
N [(A/x0  + Ny(s)/a2)/( A + N/a2),  (A  + N/a2)-1  as  the  starting  posterior  distribu- 
tion for  0.  But  in  order  to  start  with  an  overdispersed  distribution  as  recommended 
by  Gelman  and  Rubin,  we  take  t distribution  with  2 degree  of  freedom.  Also,  note 
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that  once  the  initial  6 value  have  been  generated,  the  rest  of  the  procedure  uses  the 

posterior  distributions  as  given  in  (i)-(iii)  in  Section  4.2.  Similar  procedures  can  be 

used  for  the  unknown  a2  case. 

Next,  as  in  Gelman  and  Rubin,  we  compute 

B/5000=the  variance  between  the  10  sequence  means  0j.  each  based  on  5000  6 values, 

10  _io 

that  is  B/5000  = ^T(0j.  — 6..)2  /(10  — 1),  where  6..  = 9t.. 

»=1  i= 1 

W—  the  average  of  the  10  within-sequence  variance  s2  each  based  on  (5000-1)  degrees 

10 

of  freedom,  that  is  W = ^ ^ si  • 

i=l 

Then,  find 

„ o 5000-  1 1 

a2  — W B 

5000  5000 

and 

V ~ a + (io)(5000)  B' 

Finally,  find  R = V/W.  If  R is  near  1 for  all  scalar  estimands  of  interest,  it  is 
reasonable  to  assume  that  the  desired  convergence  is  achieved  in  the  Gibbs  sampling 
algorithm  (see  Gelman  and  Rubin  (1992)  for  the  complete  discussion). 

The  second  column  of  Table  4.2  provides  the  R values  (the  potential  scale  reduc- 
tion factors)  corresponding  to  the  estimand  6 using  Cauchy  and  double  exponential 
priors  based  on  10  x 5000  = 50000  simulated  values.  The  third  column  provides  the 
corresponding  97.5%  quantiles  which  are  also  equal  to  1.  The  rightmost  five  columns 
of  Table  4.2  show  the  simulated  quantiles  of  the  target  posterior  distribution  of  9 for 
each  one  of  the  4 estimates  based  on  10  x 5000  = 50000  simulated  values. 
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Table  4.1.  Bayes  Estimates  and  Associated  Posterior  Standard  Deviations 


Priors 

Known 

a2 

Unknown  a2 

Bayes  Estimates 

Posterior  SD 

Bayes  Estimate 

Posterior  SD 

Normal 

184.31 

10.19 

183.70 

11.47 

DE 

187.89 

12.16 

187.09 

13.14 

t(l) 

193.38 

15.01 

192.53 

15.79 

t(3) 

188.48 

12.63 

187.69 

13.60 

t(5) 

186.58 

11.51 

185.93 

12.60 

t(10) 

185.41 

10.75 

184.68 

11.91 

t(15) 

184.94 

10.50 

184.30 

11.75 

Table  4.2.  Potential  Scale  Reduction  and  Simulated  Quantiles 


Potential  scale 
reduction 

Simulated  quantiles 

Priors 

R 

97.5% 

2.5% 

25.0% 

50.0% 

75.0% 

97.5% 

Known  a2 

Cauchy 

1.00 

1.00 

160.00 

171.98 

183.09 

198.17 

226.89 

DE 

1.00 

1.00 

158.68 

168.54 

176.00 

185.55 

206.82 

Unknown  a2 

Cauchy 

1.00 

1.00 

157.54 

170.38 

181.46 

196.96 

226.55 

DE 

1.00 

1.00 

156.63 

167.27 

174.63 

184.55 

206.39 

CHAPTER  5 

BAYESIAN  ROBUSTNESS  IN  SMALL  AREA  ESTIMATION 


5.1  Introduction 


Small  area  estimation  is  becoming  important  in  survey  sampling  due  to  a growing 
demand  for  reliable  small  area  statistics  from  both  public  and  private  sectors.  In 
typical  small  area  estimation  problems,  there  exist  a large  number  of  small  areas, 
but  samples  available  from  an  individual  area  are  not  usually  adequate  to  achieve 
accuracy  at  a specified  level.  The  reason  behind  this  is  that  the  original  survey  was 
designed  to  provide  specific  accuracy  at  a much  higher  level  of  aggregation  than  that 
for  small  areas.  This  makes  it  a necessity  to  “borrow  strength”  from  related  areas 
through  implicit  or  explicit  models  that  connect  the  small  areas  to  find  more  accurate 
estimates  for  a given  area,  or  simultaneously,  for  several  areas.  Ghosh  and  Rao  (1994) 
have  recently  surveyed  an  early  history  as  well  as  the  recent  developments  on  small 
area  estimation. 

Like  frequentist  methods,  Bayesian  methods  have  also  been  applied  very  exten- 
sively for  solving  small  area  estimation  problems.  Particularly  effective  in  this  regard 
has  been  the  hierarchical  or  empirical  Bayes  (HB  or  EB)  approach  which  are  espe- 
cially suited  for  a systematic  connection  of  the  small  areas  through  models.  For  the 
general  discussion  of  the  EB  or  HB  methodology  in  the  small  area  estimation  context, 
we  may  refer  to  Fay  and  Herriot  (1979),  Ghosh  and  Meeden  (1986),  Ghosh  and  Lahiri 
(1987),  Datta  and  Ghosh  (1991),  Datta  and  Lahiri  (1994),  among  others. 
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In  this  chapter,  we  propose  an  alternative  Bayesian  approach,  namely  the  robust 
Bayes  (RB)  idea  which  has  been  discussed  in  the  previous  chapters  in  the  context  of 
a single  stratum.  Specifically,  the  HB  procedure  models  the  uncertainty  in  the  prior 
information  by  assigning  a single  distribution  (often  noninformative  or  improper)  to 
the  prior  parameters  (usually  called  hyperparameters).  Instead,  as  discussed  in  the 
earlier  chapters,  the  RB  procedure  attempts  to  quantify  the  subjective  information 
in  terms  of  a class  T of  prior  distributions. 

In  order  to  study  Bayesian  robustness  in  the  context  of  small  area  estimation,  we 
consider  the  following  hierarchical  Bayes  model. 

(A)  Conditional  on  0,  /3,  and  r2,  let  Yi,  - ■ • ,YP  be  independently  distributed  with 
Yt  ~ N(6i,  Vi ),  i = 1,  • • • ,p,  where  the  Vj’s  are  known  positive  constants; 

(B)  Conditional  on  /3,  and  t2,  0i,  - • • , @p  are  independently  distributed  with  0,  ~ 

N(xf(3,T%i  = 1,  - - - , p,  where  aq,  • • • , xp  are  known  regression  vectors  of  dimension 
s and  (3  is  sxl; 

(C)  (3  ~ uniform(7?s)  and  r2  is  assumed  to  be  independent  of  (3  having  a distri- 
bution h(r2)  which  belongs  to  a certain  class  of  distributions  I\ 

We  shall  use  the  notations  Y = (Yu  ■ • • , Yp)T,  6 = (6X , ■ • • , 9p)T , X = (sq , • • • , xp)T. 
Write  G - Diag{Ri,  - • • , Vp}  and  assume  rank  ( X ) = s.  Cano  (1993)  considered  a 
special  case  of  this  model  when  Xi  = 1 and  Vj  = V for  i = 1 , • • • , p. 

The  outline  of  the  remaining  sections  is  as  follows.  In  Section  5.2,  we  choose  T to 
be  e-contamination  class  of  priors  where  the  contamination  class  includes  all  unimodal 
distributions.  We  develop  the  robust  hierarchical  Bayes  estimators  of  the  small  area 
means  and  the  associated  measures  of  accuracy  (i.e.,  the  posterior  variances)  based 
on  type-II  maximum  likelihood  priors.  Also  we  provide  the  range  where  the  small 
area  means  belong  under  the  ^-contamination  class. 
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In  Section  5.3,  we  choose  T to  be  the  density  ratio  class  of  priors.  As  suggested 
by  Wasserman  and  Kadane  (1992),  we  use  Gibbs  sampling  to  compute  bounds  on 
posterior  expectations  over  the  density  ratio  class. 

In  Section  5.4,  we  choose  T to  be  the  class  of  uniform  priors  on  r2  with  r2  < r2  < 
t2-  We  are  interested  in  the  sensitivity  analysis  of  the  posterior  quantity  over  T. 

Finally,  Section  5.5  contains  the  analysis  of  the  real  data  to  illustrate  the  results 
of  the  preceding  sections. 

5.2  e-Contamination  Class 

In  this  section,  we  consider  the  class  T of  priors  of  the  form 

T — {h  : h = (l  - e)h0  + eq,  q € Q},  (5.2.1) 

where  0 < e < 1 is  given,  h0  is  the  inverse  gamma  distribution  with  pdf 

q,A>  ^ 

M^2)  = r(°oj  ^2^+1  exp(-o;o/r2)/(o,00)(r2),  (5.2.2) 

denoted  by  IG(a0i  A>)>  and  Q is  the  class  of  all  unimodal  distributions  with  the  same 
mode  Tg  as  that  of  /i0. 

The  joint  (improper)  pdf  of  Y,  0,  (3  and  r2  is  given  by 

/(y,^,/3,T2)  oc  exp[— i(y  - 6)TG~\y  - 0)\(t2)~*  exp[— ^||0  - X(3\\2] 

x {(1  - £)h0(r2) +eq(T2)}.  (5.2.3) 

Integrating  with  respect  to  (3  in  (5.2.3),  one  finds  the  joint  (improper)  pdf  of  Y,  6, 
and  r2  given  by 

/(y,0,r2) 


. oc  (r2)-^  exp[-i(y  - 0)TG~\y  - G)  - ^GT(Ip  - X(XT X)~'XT)0\ 

X{(1  ~ £)h0(T2)  + eq(T2)}.  (5.2.4) 
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Write  E 1 — G 1 + (r2)  x(Jp  — X(XT  X)  lXT).  Then,  one  can  write 

(y  - 0)TG_1(y  - 9)  + (t2)~1Bt  (I  p - X(XTX)~1XT)0 

= GTE~lG  - 26TG~ly  + yrG_1i/ 

= (0  - ,EG-1y)rJE-1(0  - EG~ly)  + yT(G~1  - G^EG^y.  (5.2.5) 
From  (5.2.4)  and  (5.2.5),  we  have 

£[0|y,r2]  - EG~xy\  U[0|y,r2]  = E (5.2.6) 

Using  (5.2.5)  and  integrating  out  with  respect  to  0 in  (5.2.4),  one  gets  the  joint 
(improper)  pdf  of  Y and  r2  given  by 

f(y,r2) 

oc  \E\12(t2)-^  exp[-±yT(G-1  - G~x EG~x)y}{(\  - e)h0(r2)  + eq(r2)} (5.2.7) 
We  denote  by  m(y\h)  the  marginal  distribution  of  y with  respect  to  prior  h,  namely 

™(y\h)  = J f(y\r2)h(dT2).  (5.2.8) 

For  h E T,  we  get 

m(y\h)  = (1  - e)m(y\hQ)  + em(y\q).  (5.2.9) 

Our  objective  is  to  choose  the  ML-II  prior  h which  maximizes  m(y\h)  over  T.  This 
amounts  to  maximization  of  h(y\q)  over  q € Q.  Using  the  representation  of  each 
qe  Q as  a mixture  of  uniform  densities,  the  ML-II  prior  is  given  by 

h(r2)  = (1  - £)h0(r2)  + eq(r2)  (5.2.10) 

where  q is  uniform  (r02,  r02  + z),  z being  the  solution  of  the  equation 

f(y\z)  = lJ°  f(y\r2)dr2, 


(5.2.11) 
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and  r02  is  the  unique  mode  of  h0(r2).  Note  that 

f{y\r2)  oc  exp[-iyr(G_1  - G~xEG~x)y\.  (5.2.12) 

Write  u{  = V{/ (Vi  + r2),  * = 1,  • • • ,p,  and  D = Diag{l  - u1;  • - • , 1 - up}.  Then,  on 
simplification,  it  follows  that 

E = t2(Ip  -D)  + t2(Ip  - D)X(XTDX)~xXT(Ip  - D);  (5.2.13) 

EG1  = D + (Ip  — D)X(XtDX)~1XtD ; (5.2.14) 

EG~  V = [(1  - Ui)yi  + •••,(!  — up)yp  + upxpP]T,  (5.2.15) 

where  3 = (XT  DX)~x(XT  Dy).  Then 

G l - G~XEGX  = (t2)~x[D  - DX(XTDX)~xXTD\.  (5.2.16) 

Hence, 

VT(G~l  - G-'EG-')y 

= (r1)'1©  - <H )vt  - (E(l  - ui)yixi)T(XTDX)-\'£(\  - «,)»,*,)] 

*=1  t=l  i=l 

= Qr^(y)  (say).  (5.2.17) 

Combining  (5.2.12),  (5.2.16)  and  (5.2.17),  we  can  write 


f(y\r2)  oc  \E\*(t2)  V exp[-^QT2(y)]. 


(5.2.18) 


Writing  F — G 1 + (r2)  xIp , and  using  Exercise  2.4,  p.  32  of  Rao  (1973),  one  gets 


lET1!  = 


F X 

XT  t2(XtX) 


h \r2(XTX)\ 

= \F\\t2(XtX)~  XtF~xX\  + \t2(XtX) 
« (r2)-4(r2  + h,)}|ITDI|. 


(5.2.19) 
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It  is  clear  from  (5.2.18)  and  (5.2.19)  that 


f{yV2)  oc  (r2)2{f[(r2  + Vi)  *}\XtDX\ 


i= 1 


'*  exp[-igTa(y)]. 


(5.2.20) 


Now  to  find  the  solution  z to  the  equation  (5.2.11),  we  consider 

zf{y\z)  = / ° + f{y\T2)dr2.  (5.2.21) 

Jt0 

By  differentiating  both  sides  with  respect  to  z,  we  get 

f{y\z)  + z^-J{y\z)  = /(y|r02  + z).  (5.2.22) 

By  Lemma  A. 4. 4.,  Lemma  A. 4. 5.  and  Theorem  A. 4. 6 in  Anderson  (1984),  recall  that 
for  the  s x s matrix  A = (ay), 


and 

j-z\ 4=EE^,  (5.2.24) 

where  Ay  = ^-|A|  is  the  cofactor  of  ay.  Write  A — Y%= i ^vixixI-  Then  after  some 
calculations  using  (5.2.23)  and  (5.2.24),  it  can  be  shown  that 


dz 


fiy\z) 


= 1f[(2:  + vi)  * I H ^ exp{— 


i=l 


t=l  Z + Vi 


t=i 


t=i 


f=l  Z + Vi 


XiX { \ * exp{-— H(z)} 


~ oz*n(z  + vi)  *<r(A  ^Jexpf-^tf^)} 


2—1 


2z 
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+ 


2fn (Z  + Vt)  l2\J2—^T7Xix'[\  2 exp{ 

i= 1 t=l  z vi 


1 

2z 


J_  d_ 
2z  dz 


H{z)\ 


(5.2.25) 


where 


"w  - £ tt^.2  - <g  d^^x:  T^-n-tE 


z 

Tl  z + Vi' 


i= i 2 + ^ 

Using  (5.2.25),  (5.2.22)  leads  to 


«=i  z + Vi 


7\  z + Vi 


ViXi).  (5.2.26) 


z*H{z  + Vi)  l\Y^j^-yXiXl:\  2 exp {-1-H(z)} 


»=i 


xt1  + f - f EUz  + V^)-1  - f tr(A~'%)  4-  j-H(z)  - \±H{z)] 

= (Z  + To)f  nLl  0 + To  + Vi)~2\  E?=l  7;;4xtxfl-i  exp{-^^y//(^  + Tq  )}. 


(5.2.27) 


Now  we  have  the  following  HB  model  based  on  ML-II  prior: 

(I)  Conditional  on  9,  /3,  and  r2,  Y)  -/V(0j,  V)),i  = 1,  • • ■ ,p,  where  the  V^s  are 
known  positive  constants; 

(II)  Conditional  on  /3,  and  r2,  9i  17~  N(xJ/3:  r2),  i = 1,  • • • ,p,  where  x1?  • • • , xp 
are  known  regression  vectors  of  dimension  s and  (3  is  sxl; 

(III)  Marginally,  j3  and  r2  are  mutually  independent  with  (3  ~ uniform^5)  and 
r2  having  a density  h(r2)  = (1  -e)/i0(r2)  +£<?(t2),  where  /i0(r2)  is  IG(a0,  ft)  density, 
with  c*o  > 0 and  /?o  > 0,  and  q is  uniform  (tq  , Tq  + z ),  with  z being  the  solution  of 

(5.2.27)  and  r02  = a0/(ft  + 1). 

It  is  clear  from  (5.2.20)  that 


h(r2\y) 


oc(r2)‘{nL,(r2  + K)-i}|X7'DXj-iexp[-iQrI(y)]{(l-e)h0(T2)+£f(r2)}. 


(5.2.28) 
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Now  writing  Ui  — Vi/(r2  + Vi)  (i  = 1,  • • • ,p),  using  (5.2.28),  and  the  iterated  formulas 
for  conditional  expectations  and  variances,  one  gets 

E[0i\y\  = E[E(di\y,r2)\y]  = E[(  1 - + U{xJ  f3\y\-  (5.2.29) 

and 


vm  y]  = V[E(6i\y,T2)\y]  + E[V(9i\y,T2)\y] 

= V[{1  - Ui)yi  + UiXJf3\y]  + E[r2Ui  + T2U2xJ{XTDX)-lxi\y] 

= Vpiivi  - xj(3)\y]  + E{Vi(  1 - Ui)  + W,(l  - Ui)xJ(XTDX)~lxi\y}. 

(5.2.30) 

Thus,  the  posterior  distribution  of  G under  the  ML-II  prior  is  obtained  using  (5.2.4) 
and  (5.2.28).  In  addition,  one  uses  (5.2.29)  and  (5.2.30)  to  find  the  posterior  means 
and  variances  of  under  this  prior.  Similarly,  by  using  the  iterated  formulas,  posterior 
covariances  may  be  obtained  as  well. 

Next  we  consider  the  problem  of  finding  the  range  of  the  posterior  mean  of  6i  over 
T in  (5.2.1).  Using  the  expression  h(r2\y)  oc  /(y|r2)h(r2),  we  have 

E\0i\v]  = Vi  ~ E[Ui(yi  - xj(3)\y] 

v _ Jo00  UtiVi  ~ xf f3)h(r2\y)dr2 
Jo00  h(r2\y)dT2  ‘ 

Simple  modifications  of  the  arguments  of  Sivaganesan  and  Berger  (1989)  or  Cano 
(1993)  leads  to  the  following  result 


sup(inf)£[0j|y]  = sup(inf)- 

/iGT  t 


A + ~ ui(Vi  ~ xj(3))f(y\r2)dr2 

E + f*2  f(y\T2)dr2 


(5.2.32) 


where 


B = - — - f f (y\r2)h0(r2)dr2 
£ Jo 


(5.2.33) 
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and 

2 — £ roo 

A = ViB — Ui(yi  - xJp)f(y\T2)h0(T2)dT2.  (5.2.34) 

o J 0 

The  above  sup  (inf)  can  be  obtained  by  numerical  optimization.  These  formulas  will 
be  used  in  Section  5.5  in  the  context  of  estimation  of  median  income  of  four-person 
families. 


5.3  Density  Ratio  Class 


In  this  section  we  consider  a class  of  priors,  introduced  by  DeRobertis  and  Hartigan 
(1981),  and  called  a density  ratio  class  by  Berger  (1990), 


^ l,u 


{h  : 


Mr2)  ^ Mr2) 


< 


h(r'2)  1(t'2) 


for  all  r2  and  r'2}, 


(5.3.1) 


where  l and  u are  two  bounded  nonnegative  functions  such  that  /(r2)  < u(r2)  for  all 
t2.  This  class  can  be  viewed  as  specifying  ranges  for  the  ratios  of  the  prior  density 
between  any  two  points.  By  taking  u = kho  and  l — ho,  we  have  very  interesting 
subclass, 

r*(M  = ih : ^ for  al1  r2  and  r'2}>  (5-3-2) 

where  k > 1 is  a constant.  This  class  may  be  thought  of  as  a neighborhood  around 
the  target  prior  /i0-  The  interpretation  is  that  the  odds  of  any  pair  of  points  are  not 
misspecified  by  more  than  a factor  of  k.  This  prior  is  especially  useful  when  h0  is  a 
default  prior  chosen  mainly  for  convenience. 

Because  of  the  expression  h(r2\y)  oc  f(y\T2)h(r2)  we  can  view  our  problem  as 
having  just  the  parameter  r2,  h being  the  prior  and  f{y\r2)  the  likelihood.  Since 


B[9i\y]  =Vi-E[ 


Vi 


r2  + Vi 


{Vi 


x. 


’(£ 


lr2  + Vt 


xixfr1^ 


T 


2=1 


T2  + Vi 


ViXi)M,  (5-3.3) 


our  problem  reduces  to  finding 
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sup  (inf)  E[b{r2)\y]  (5.3.4) 

fcerf(fto) 


where  b{r2)  = y < - xf(Ef=1 

Wasserman  and  Kadane  (1992)  have  developed  a Monte  Carlo  technique  which 
can  be  used  to  bound  posterior  expectations  over  the  density  ratio  class.  Wasserman 
(1992)  has  shown  that  the  set  of  posteriors  obtained  by  applying  Bayes’  theorem  to  the 
density  ratio  class  T%(h0)  is  the  density  ratio  class  T^(h°y):  where  h°y(r2)  = h0{r2\y ) 
is  the  posterior  corresponding  to  hQ.  To  see  this  all  we  need  to  do  is  to  write 


If(*£)  = {hy  : 


Mr2)  < ,,  hy  (r2) 

hy{r'2)  ~ K hy{T'2) 


for  all  t2  and  t'2}, 


(5.3.5) 


and  observe  - Ey\r2)h{r2)  , h0{r2\y) 

ousei  ve  fc(T,2|y)  — f(y\T'2)h(T’2)  ancl  '■ 


ho{j'2\y)  ~ f(y\r'2)h0(r'2) 


f(y\T'2)ho{r2)  r h at  h G TR(hn)  is 

f(V\T'2Vin(T'2\  S0  tnat  n 1 k \no)  1S 


equivalent  to  hy  G TR(hy),  where  hy(r2)  = h(r2\y).  Wasserman  (1992)  calls  this  as 
Bayes  invariance  property.  Hence,  to  bound  the  posterior  expectation  of  5(r2),  we 
only  need  bound  the  expectation  of  6(r2)  on  Tf  (h°y).  To  do  so,  we  will  need  to  draw 
a sample  from  the  posterior  hy.  Following  Wasserman  and  Kadane  (1992),  we  can 
rely  on  recent  sampling-based  methods  for  Bayesian  inference.  Note  in  this  case  that 


V^2|  y)  oc  ( r2)2  {J[(t2  + Vi)~2}\XT  DX\ 


t=i 


'2  exp[-iQT2(y)]  h0{r2).  (5.3.6) 


Let  t2 , • • • , T^f  be  a random  sample  from  /i0(r2|y).  Let  b{  = b(r2),i  = 1 
and  let  6p)  < &(2)  < • • • < 5( n)  be  the  corresponding  order  statistics.  Also,  let 
ci  = ~Kt2)i  i — 1) ' ’ * ) N and  let  cp)  < C(2)  < • • • < C(N)  be  the  corresponding  order 

statistics.  Let  b = J2iL\bi/N  and  c = Eili  Ci/AL  Then  following  Wasserman  and 
Kadane  (1992),  we  get 


sup  E[b{r2)\y]  = sup  E[b{r2)\y) 
her*(h0)  hyer  *(h°y) 
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« max  { (1 

1 <i<N  K 


j)4  + l)-‘{4E»S#  + ‘)  (5-3.7) 

3=i 


and 


inf  E[b(r2)\y\  = inf  £[5(r2)|y] 
*6rf(fco)  hy^vk(hy) 


N 


- - Ti)A  + 1}  '{AEco./^^}  (5.3.8) 


TV 


J = * 


where  A = k — 1.  This  gives  the  posterior  bound  for  a given  k. 

To  generate  the  sample  from  hy , we  use  Gibbs  sampling.  The  Gibbs  sampling 
analysis  is  based  on  the  following  posterior  distributions  : 

(i) /3  | y,0,z  ~ Ns({XTX)~1XT0,  z-\XTX)~1)- 

(ii)  I y,0,z  ~ iV  ((1  - Bi)yi  + B^f/3  , 

(iii) r2  | y,0,0  ~IG(a0  + |E™i(^-*r/3)2,  Jp  + fi,); 
where  Z = (r2)”1  and  Bz  = z/(z  + &),  where  & = 1/V$,  * = 1,  • - - ,p. 

Note  that  our  target  is  to  draw  samples  from  /i0(r2|y).  However,  as  is  well  known 
in  the  Gibbs  sampling  literatures,  the  histogram  of  the  (r2)^,  t = 1,2  ,■■■  ,q,  samples 
drawn  from  the  conditionals  [r2  | y,  0 , 0]  converges  to  the  conditional  distribution 
[r  | y]  as  q — ■*  oo.  This  along  with  (5.3.7)  and  (5.3.8)  facilitates  the  computations  of 
upper  and  lower  bounds  of  E[b(r2)\y\. 


5.4  Class  of  Uniform  Priors 


In  usual  HB  model  in  our  setting,  one  might  use  the  diffuse  prior  on  r2.  Instead, 
we  consider  the  class  of  uniform  priors  on  r2  with  constraints  of  the  form 


T = {t2  : t2  < t2  < r2}, 


(5.4.1) 
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where  t\  and  r22  are  arbitrary  nonnegative  numbers  such  that  r2  < r|.  This  class  of 
priors  is  attractive  because  of  its  mathematical  convenience,  and  indeed  give  a good 
enough  class  of  priors  for  an  honest  robustness  check.  Classes  of  conjugate  priors 
having  parameters  in  certain  ranges  have  been  studied  by  Learner  (1982)  and  Polasek 
(1985). 

From  (5.2.6)  and  (5.2.15),  we  have 
sup  (infJEftly.T2] 

Tj2<r2<r| 


- Vi  TJrn/<r„2(suP)^TT yivi  ~ x r (E zrrvXiXT)  J(E 


T 


Ti  <r2<r| 


r2  + K: 


i=  1 


r2  + V 


ViXi)}. 


(5.4.2) 

Hence,  we  can  find  the  range  of  the  posterior  quantity  E[6i\y,r2]  over  T in  (5.4.1), 
and  investigate  the  sensitivity  with  different  choices  of  r2  and  r|. 


5.5  An  Example 


In  this  section  we  illustrate  the  methods  suggested  in  preceding  sections  with  an 
analysis  of  real  data  set.  The  data  set  is  related  to  the  estimation  of  median  income 
for  foui -person  families  by  states.  To  start  with,  we  briefly  give  a background  of  this 
problem. 

The  U.S.  Department  of  Health  and  Human  Services  (HHS)  administers  a program 
of  energy  assistance  to  low-income  families.  Eligibility  for  the  program  is  determined 
by  a formula  where  the  most  important  variable  is  an  estimate  of  the  current  median 
income  for  four-person  families  by  states  (the  50  states  and  the  District  of  Columbia). 

The  Bureau  of  the  Census  provides  such  estimates  to  the  HHS  annually  since  the 
latter  part  of  the  1970’s  using  a linear  regression  methodology.  The  basic  source  of 
the  data  is  the  annual  demographic  supplement  to  the  March  Sample  of  the  Current 
Population  Survey  (CPS)  which  provides  median  income  for  four-person  families  for 
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the  preceding  year.  In  addition,  once  every  ten  years,  similar  figures  are  obtained 
from  the  decennial  census  for  the  calendar  year  immediately  preceding  the  census 
year.  The  latter  set  of  estimates  is  believed  to  be  much  more  accurate  and  serve  as 
the  “gold  standard”  against  which  other  estimates  need  to  be  tested. 

Direct  use  of  CPS  estimates  was  found  undesirable  because  of  the  large  coefficient 
of  variation  associated  with  them  (cf  Fay  (1987)).  The  regression  method  used  by  the 
Bureau  of  the  Census  was  intended  to  rectify  this  drawback.  The  method  that  is  used 
since  1985  is  as  follows.  The  current  year  CPS  estimates  of  statewide  median  incomes 
of  four-person  families  are  used  as  dependent  variables  in  a linear  regression.  In 
addition  to  the  intercept  term,  the  regression  equation  uses  as  independent  variables 
the  base  year  census  median(b),  and  the  adjusted  census  median(c),  where  census 
median(b)  represents  the  median-income  of  four-person  families  in  the  state  for  the 
base  year  b from  the  most  recently  available  decennial  census.  The  adjusted  census 
median  (c)  is  obtained  from  the  formula 

Adjusted  census  median(c)  = [BEA  PCI(c)/BEA  PCI(6)] 

x Census  median (5).  (5.5.1) 

In  the  above  BEA  PCI(c)  and  BEA  PCI(6)  represent  respectively  estimates  of  per- 
capita  income  produced  by  the  Bureau  of  Economic  Analysis  of  the  U.S.  Department 
of  Commerce  for  the  current  year  c,  and  the  base  year  5,  respectively.  Thus  (5.5.1) 
attempts  to  adjust  the  base  year  census  median  by  the  proportional  growth  in  the 
BEA  PCI  to  arrive  at  the  current  year  adjusted  median.  The  inclusion  of  the  census 
median (b)  as  a second  independent  variable  is  to  adjust  for  any  possible  overstatement 
of  the  effect  of  change  in  BEA  PCI  upon  the  median  income  of  four-person  families. 

Finally,  a weighted  average  of  the  CPS  sample  estimate  of  the  current  median 
income  and  the  regression  estimates  is  obtained,  weighting  the  sample  estimates  in- 
versely proportional  to  the  sampling  variances  and  the  regression  estimates  inversely 
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proportional  to  the  model  error  due  to  the  lack  of  fit  for  the  census  values  of  1979  by 
the  model  with  1969  used  as  the  base  year. 

The  data  consist  of  1*,  the  sample  median  income  of  four-person  families  in  state 
i and  associated  variance  V{  (i  = 1,  • • • , 51).  The  true  medians  corresponding  to  Y^s 
are  denoted  by  6>j’s  respectively.  The  design  matrix  X = (xu  ■ ■ ■ , x51)r  is  of  the  form 

xj  = (1  xu  xi2) , (i  = 1,  • • • , 51),  (5.5.2) 

where  Xu  and  X{2  denote  respectively  the  adjusted  census  median  income  and  the 
base-year  census  median  income  for  four-person  families  in  the  ith  state. 

We  consider  the  HB  model  as  given  in  Section  5.1.  We  choose  Y to  be  e- 
contamination  class  of  priors  where  the  contamination  class  includes  all  unimodal 
distributions.  We  find  RB  estimates  of  statewise  median  incomes  of  four-person  fam- 
ilies for  1989  using  1979  as  the  base  year  based  on  ML-II  priors.  In  finding  RB 
estimates,  we  have  used  (5.2.29)  and  (5.2.30)  with  a0=l,  A)  =10  and  e=.l.  The  al- 
ternatives in  our  setting  are  HB  and  EB  estimates  of  6{.  For  the  HB  analysis,  we  use 
diffuse  prior  on  r2  instead  of  using  class  of  priors.  The  EB  method  is  due  to  Morris 
(1983b)  which  uses  estimates  of  r2  and  (3  rather  than  assigning  any  prior  distribution 
on  hyperparameters.  Since  in  addition  to  the  sample  estimates,  the  1989  incomes 
are  also  available  from  the  1990  census  data,  we  compare  all  three  Bayes  estimates 
against  the  1990  census  figures,  treating  the  latter  as  the  “truth”.  Table  5.1  provides 
the  “truth”,  the  CPS  estimates  and  three  Bayes  estimates,  whereas  Table  5.2  provides 
the  standard  errors  associated  with  three  Bayes  estimates. 

Now  these  estimates  are  compared  according  to  several  criteria.  Suppose  eiTR 
denotes  the  true  median  income  for  the  ith  state,  and  e*  any  estimate  of  eiTR.  Then 
for  the  estimate  e = (e: , • • • , e51  )T  of  eTR  = (elTR , • • • , e5lTR  )T,  let 


51 

Average  Relative  Bias  = (51)-1  ^ |e<  - eiTR\ /eiTR 

i=i 
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51 

Average  Squared  Relative  Bias  = (51)_1  ]T  \a  - eiTR\ 2 /e-T 

i=l 

51 

Average  Absolute  Bias  = (51)_1  ^ \e{  - eiTJ 

i= 1 

51 

Average  Squared  Deviation  = (51 )-1  ^ (et  - e^)2 

t=i 

The  above  quantities  are  provided  in  Table  5.3  for  all  three  Bayes  estimates. 

From  Table  5.1,  one  can  see  that  three  Bayes  estimates  of  the  small  area  means  are 
quite  close  to  each  other.  But  from  Table  5.2,  it  appears  that  the  estimated  standard 
errors  given  by  RB  method  are  larger  than  those  given  by  HB  or  EB  methods.  Table 
5.3  indicates  that  the  EB  estimates  are  the  best  and  the  HB  estimates  are  the  second 
best  under  all  criteria  for  this  data  set.  As  anticipated  also,  the  HB  standard  errors 
are  slightly  bigger  than  the  EB  standard  errors.  As  is  well-known,  this  phenomenon 
can  be  explained  by  the  fact  that  unlike  the  EB  estimators,  the  HB  estimators  take 
into  account  the  uncertainty  for  estimating  the  prior  parameters. 

Next,  we  find  the  ranges  of  the  posterior  means  for  the  small  areas  under  e- 
contamination  class.  Table  5.4a  and  Table  5.4b  provide  the  ranges  of  posterior  means 
under  e-contamination  class  when  (ao>  A)  — (1,  10)  and  (ao>  A))  = (7,  3)  respectively. 
For  (a0,  A>)  = (1)  10),  the  ranges  are  fairly  small  and  robustness  seems  to  be  achieved 
using  this  class  for  all  e values.  But  for  (of0,  A) ) — (7,  3),  the  ranges  are  relatively 
wider  in  comparison  with  (ao,  A)  = (1,  10).  As  one  might  expect,  the  choice  of  ho, 
that  is,  the  elicited  inverse  gamma  prior,  seems  to  have  some  effect  on  the  ranges  of 
the  posterior  means  for  e-contamination  class.  Note  that  the  inverse  gamma  prior  of 
t2  with  (a0,  A)  = (1>  10)  has  coefficient  of  variation  1/8  compared  to  1 with  (a0, 
A)  = (7,  3).  Although  the  two  priors  have  very  similar  tails,  the  former  is  much 
flatter  than  the  latter  even  for  small  values  of  r2.  This  suggests  that  the  bigger  the 
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tauA2 


Figure  5.1.  The  IG(1,  10)  Prior 

coefficient  of  variation  of  the  assumed  inverse  gamma  prior,  the  wider  is  the  range  of 
the  posterior  means  of  the  Of  s. 

We  also  find  the  ranges  of  the  posterior  means  under  density  ratio  classes.  In 
computing  the  bounds,  we  have  considered  Gibbs  sampler  with  10  independent  se- 
quences, each  with  a sample  of  size  1000  with  a burn-in  sample  of  another  1000.  We 
adopt  the  basic  approach  of  Gelman  and  Rubin  (1992)  to  monitor  the  convergence  of 
the  Gibbs  sampler.  We  have  obtained  R values  (the  potential  scale  reduction  factors) 
corresponding  to  the  estimand  6t  based  on  10  x 1000  = 10000  simulated  values.  The 
fact  that  all  the  point  estimates  R are  equal  to  1 as  well  as  the  near  equality  of  these 
point  estimates  and  the  corresponding  97.5%  quantiles  suggests  that  convergence  is 
achieved  in  the  Gibbs  sampler.  Table  5.5a  and  Table  5.5b  provide  the  ranges  of  the 
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tauA2 

Figure  5.2.  The  IG(7,  3)  Prior 

posterior  means  under  density  ratio  classes  when  (a0,  Po)  = (1,  10)  and  (a0,  Po)  = 
(7,  3)  respectively.  Note  however  that  here  the  bounds  given  for  IG(7,  3)  are  much 
closer  than  the  ones  for  IG(1,  10).  The  intuitive  explanation  for  this  phenomenon 

is  that  while  can  be  extremely  large  for  certain  choices  ( r 2,r/2)  corresponding 

to  the  IG(1,  10)  prior,  the  ratio  is  more  under  control  for  the  IG(7,  3)  prior. 

This  density  ratio  classes  are  very  convenient  to  represent  vague  prior  knowledge  and 
robustness  seems  to  be  achieved  using  the  IG(7,  3)  prior. 

Finally,  we  consider  the  ranges  of  the  posterior  quantities  E[$i\y,  r2]  over  the  class 
of  uniform  priors.  From  Table  5.6,  we  can  see  that  the  ranges  are  not  sensitive  to 
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the  choice  of  the  upper  bound  of  r2.  Also,  for  most  of  the  states,  the  ranges  of  the 
posterior  means  do  not  seem  to  be  too  wide. 


Table  5.1  Median  Income  Estimates  for  4-Person  Family  in  1989  (in  Dollars) 


State 

Census 

CPS 

RB 

HB 

EB 

1 

38035 

37700 

38027 

38004 

38011 

2 

46709 

52720 

48652 

48389 

48222 

3 

39601 

41489 

40319 

40170 

40109 

4 

51498 

55729 

53294 

52986 

52865 

5 

45072 

47106 

43905 

43727 

43607 

6 

55115 

55048 

53224 

53092 

53015 

7 

45757 

43434 

43800 

43853 

43869 

8 

54398 

53128 

53292 

53300 

53310 

9 

40833 

40990 

40820 

40780 

40774 

10 

40092 

44241 

42874 

42657 

42596 

11 

39727 

40297 

39684 

39624 

39594 

12 

43960 

42590 

42762 

42792 

42800 

13 

43166 

43260 

42973 

42924 

42913 

14 

40635 

42226 

41196 

41065 

41015 

15 

43212 

45432 

43653 

43507 

43425 

16 

36880 

38711 

37871 

37761 

37720 

17 

37674 

39232 

39012 

38981 

38969 

18 

33426 

36966 

35381 

35171 

35093 

19 

32289 

32610 

32447 

32406 

32398 

20 

36519 

40894 

38789 

38543 

38438 

21 

37997 

42781 

39232 

38960 

38798 

22 

46819 

39819 

41551 

41651 

41724 

23 

51082 

48252 

49791 

49846 

49898 

24 

43279 

36587 

40165 

40294 

40411 

25 

43223 

49718 

45988 

45775 

45632 

Table  5.1  (continued) 


State 

Census 

CPS 

RB 

HB 

EB 

26 

31071 

31604 

32124 

32179 

32204 

27 

37139 

37423 

37555 

37562 

37569 

28 

35901 

31834 

34314 

34510 

34627 

29 

39036 

45876 

41845 

41612 

41455 

30 

38296 

37194 

37735 

37828 

37849 

31 

32566 

32186 

33603 

33712 

33778 

32 

35383 

31327 

33663 

33907 

34022 

33 

34153 

32245 

34021 

34179 

34265 

34 

29987 

31330 

31863 

31900 

31925 

35 

30673 

33317 

33015 

32952 

32935 

36 

32023 

36219 

35230 

35147 

35101 

37 

33111 

40358 

36251 

35997 

35827 

38 

36513 

32679 

33806 

34008 

34053 

39 

32677 

29586 

32177 

32446 

32574 

40 

33385 

32227 

33025 

33110 

33149 

41 

36619 

38931 

36089 

35917 

35798 

42 

40956 

36732 

39005 

39178 

39284 

43 

31046 

29761 

31261 

31385 

31456 

44 

37322 

38969 

38885 

38863 

38855 

45 

36567 

37938 

37056 

36974 

36932 

46 

41289 

34579 

38180 

38517 

38694 

47 

41870 

42392 

41666 

41587 

41552 

48 

37596 

41419 

39505 

39336 

39245 

49 

46088 

41121 

41669 

41765 

41788 

50 

51127 

41379 

46512 

46890 

47123 

51 

47341 

44999 

44395 

44366 

44347 

Table  5.2  Estimated  Standard  Errors  of  Median  Income  Estimates 
for  4-Person  Family  by  State  in  1989  (in  Dollars) 


State 

SD.RB 

SD.HB 

SD.EB 

1 

2300 

2207 

2152 

2 

2594 

2526 

2470 

3 

2120 

2054 

2005 

4 

2009 

2029 

1992 

5 

2546 

2437 

2352 

6 

2557 

2458 

2388 

7 

1361 

1336 

1329 

8 

1696 

1661 

1649 

9 

1399 

1366 

1356 

10 

1590 

1595 

1567 

11 

2119 

2028 

1976 

12 

1533 

1494 

1482 

13 

1420 

1390 

1381 

14 

1840 

1790 

1761 

15 

2227 

2141 

2087 

16 

1818 

1766 

1739 

17 

2328 

2211 

2131 

18 

1836 

1816 

1787 

19 

1759 

1706 

1683 

20 

1950 

1929 

1895 

21 

2285 

2239 

2209 

22 

2349 

2240 

2172 

23 

2675 

2545 

2449 

24 

2558 

2435 

2368 

25 

2536 

2441 

2372 

Table  5.2  (continued) 


State 

SD.RB 

SD.HB 

SD.EB 

26 

1833 

1772 

1748 

27 

1357 

1327 

1321 

28 

2141 

2082 

2058 

29 

2540 

2450 

2383 

30 

1271 

1257 

1249 

31 

2102 

2023 

1986 

32 

1934 

1917 

1903 

33 

2025 

1965 

1938 

34 

1905 

1840 

1815 

35 

1932 

1868 

1836 

36 

2258 

2160 

2098 

37 

2447 

2378 

2338 

38 

1393 

1412 

1385 

39 

1980 

1967 

1948 

40 

1822 

1765 

1741 

41 

2504 

2432 

2405 

42 

2233 

2155 

2109 

43 

2086 

2015 

1981 

44 

2584 

2440 

2316 

45 

2205 

2108 

2047 

46 

2132 

2139 

2124 

47 

1952 

1884 

1850 

48 

2164 

2092 

2046 

49 

1410 

1388 

1376 

50 

2710 

2718 

2711 

51 

2599 

2460 

2346 

98 


Table  5.3  Average  Relative  Bias,  Average  Squared  Relative  Bias,  Average 
Absolute  Bias,  and  Average  Squared  Deviations  of  the  Estimates 


Estimate 

Avg  Rel  Bias 

Avg  Sq  Rel  Bias 

Avg  Abs  Bias 

Avg  Sq  Dev 

RB 

.0399 

.0025 

1599.98 

4115162.29 

HB 

.0376 

.0022 

1507.79 

3690921.55 

EB 

.0365 

.0021 

1464.39 

3489870.43 
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Table  5.4a  Range  of  Posterior  Mean  of  6i  for  e-contamination  Class 
when  cxq  — 1 and  /30  = 10 

(In  Dollars) 


State 

£ = 

.1 

£ = 

.2 

£ = 

.5 

£ = 

= 1 

inf 

sup 

inf 

sup 

inf 

sup 

inf 

sup 

1 

38022 

38028 

38018 

38029 

38007 

38031 

37991 

38036 

2 

48572 

48677 

48502 

48705 

48324 

48800 

48081 

49112 

3 

40277 

40330 

40241 

40344 

40147 

40388 

40018 

40528 

4 

53208 

53318 

53133 

53345 

52940 

53436 

52673 

53722 

5 

43850 

43922 

43802 

43942 

43680 

44009 

43513 

44229 

6 

53184 

53235 

53150 

53249 

53062 

53295 

52941 

53444 

7 

43796 

43814 

43791 

43827 

43777 

43859 

43732 

43905 

8 

53291 

53295 

53289 

53297 

53285 

53304 

53271 

53313 

9 

40810 

40822 

40801 

40825 

40777 

40834 

40745 

40862 

10 

42816 

42890 

42765 

42907 

42633 

42966 

42449 

43145 

11 

39667 

39689 

39651 

39695 

39612 

39714 

39559 

39776 

12 

42760 

42770 

42757 

42777 

42749 

42795 

42724 

42821 

13 

42960 

42976 

42949 

42980 

42920 

42992 

42879 

43030 

14 

41160 

41206 

41128 

41218 

41046 

41257 

40933 

41377 

15 

43610 

43666 

43572 

43680 

43476 

43730 

43344 

43891 

16 

37841 

37879 

37814 

37889 

37746 

37921 

37651 

38021 

17 

39004 

39015 

38996 

39018 

38976 

39027 

38949 

39056 

18 

35323 

35397 

35272 

35415 

35141 

35477 

34961 

35667 

19 

32436 

32450 

32427 

32453 

32403 

32463 

32369 

32493 

20 

38720 

38809 

38659 

38831 

38503 

38906 

38289 

39142 

100 


Table  5.4a  (continued) 


(In  Dollars) 


£ = 

.1 

£ = 

.2 

£ = 

.5 

£ = 

= 1 

State 

inf 

sup 

inf 

sup 

inf 

sup 

inf 

sup 

21 

39150 

39256 

39079 

39284 

38897 

39380 

38650 

39689 

22 

41542 

41583 

41530 

41610 

41492 

41679 

41364 

41773 

23 

49785 

49810 

49778 

49826 

49753 

49867 

49669 

49921 

24 

40151 

40209 

40135 

40246 

40078 

40341 

39886 

40468 

25 

45922 

46008 

45865 

46032 

45719 

46111 

45520 

46374 

26 

32120 

32140 

32114 

32153 

32097 

32189 

32043 

32237 

27 

37554 

37557 

37553 

37559 

37550 

37565 

37539 

37572 

28 

34297 

34373 

34277 

34424 

34208 

34555 

33987 

34733 

29 

41773 

41867 

41710 

41892 

41551 

41979 

41334 

42265 

30 

37728 

37759 

37721 

37781 

37697 

37837 

37625 

37915 

31 

33593 

33636 

33582 

33665 

33544 

33737 

33419 

33836 

32 

33643 

33733 

33620 

33794 

33542 

33951 

33298 

34166 

33 

34007 

34067 

33991 

34108 

33938 

34212 

33768 

34354 

34 

31859 

31874 

31855 

31884 

31842 

31909 

31797 

31943 

35 

32998 

33020 

32983 

33025 

32945 

33041 

32892 

33092 

36 

35205 

35237 

35184 

35245 

35129 

35273 

35055 

35364 

37 

36173 

36275 

36104 

36303 

35930 

36398 

35694 

36708 

38 

33792 

33859 

33777 

33905 

33726 

34026 

33570 

34194 

39 

32155 

32254 

32129 

32322 

32043 

32495 

31772 

32733 

40 

33018 

33049 

33010 

33071 

32984 

33125 

32899 

33200 

Table  5.4a  (continued) 


(In  Dollars) 


£ = 

= .1 

£ = 

= .2 

£ = 

= .5 

£ = 

= 1 

State 

inf 

sup 

inf 

sup 

inf 

sup 

inf 

sup 

41 

36036 

36106 

35989 

36125 

35871 

36190 

35710 

36404 

42 

38989 

39057 

38971 

39103 

38910 

39219 

38711 

39377 

43 

31250 

31298 

31237 

31330 

31194 

31412 

31057 

31525 

44 

38879 

38886 

38873 

38888 

38860 

38894 

38841 

38913 

45 

37032 

37063 

37011 

37071 

36958 

37098 

36885 

37184 

46 

38151 

38278 

38118 

38365 

38006 

38585 

37649 

38887 

47 

41644 

41672 

41624 

41679 

41574 

41704 

41505 

41781 

48 

39456 

39520 

39412 

39537 

39301 

39593 

39149 

39775 

49 

41662 

41694 

41654 

41716 

41630 

41774 

41554 

41854 

50 

46477 

46626 

46437 

46726 

46302 

46980 

45864 

47327 

51 

44386 

44398 

44378 

44401 

44358 

44411 

44331 

44447 

102 


Table  5.4b  Range  of  Posterior  Mean  of  9{  for  e-contamination  Class 

when  a0  — 7 and  /30  = 3 

(In  Dollars) 


£ = 

.1 

£ = 

.2 

£ = 

.5 

£ = 

= 1 

State 

inf 

sup 

inf 

sup 

inf 

sup 

inf 

sup 

1 

37895 

38034 

37890 

38034 

37881 

38034 

37813 

38034 

2 

48389 

51266 

48389 

51311 

48389 

51389 

48389 

51931 

3 

40170 

41215 

40170 

41225 

40170 

41243 

40170 

41355 

4 

52986 

55129 

52986 

55151 

52986 

55188 

52986 

55430 

5 

43727 

45865 

43727 

45902 

43727 

45966 

43727 

46417 

6 

53092 

54424 

53092 

54444 

53092 

54478 

53092 

54713 

7 

43522 

43853 

43519 

43853 

43513 

43853 

43478 

43853 

8 

53179 

53300 

53177 

53300 

53174 

53300 

53154 

53300 

9 

40780 

40962 

40780 

40964 

40780 

40966 

40780 

40977 

10 

42657 

43938 

42657 

43949 

42657 

43968 

42657 

44092 

11 

39624 

40123 

39624 

40129 

39624 

40139 

39624 

40208 

12 

42623 

42792 

42622 

42792 

42619 

42792 

42606 

42792 

13 

42924 

43195 

42924 

43198 

42924 

43202 

42924 

43228 

14 

41065 

41971 

41065 

41980 

41065 

41996 

41065 

42098 

15 

43507 

44876 

43507 

44895 

43507 

44926 

43507 

45141 

16 

37761 

38506 

37761 

38514 

37761 

38527 

37761 

38609 

17 

38981 

39191 

38981 

39192 

38981 

39196 

38981 

39214 

18 

35171 

36585 

35171 

36599 

35171 

36623 

35171 

36777 

19 

32406 

32596 

32406 

32597 

32406 

32598 

42306 

32606 

20 

38543 

40353 

38543 

40372 

38543 

40405 

38543 

40622 

103 


Table  5.4b  (continued) 


(In  Dollars) 


£ = 

.1 

£ = 

= .2 

£ = 

= .5 

£ = 

= 1 

State 

inf 

sup 

inf 

sup 

inf 

sup 

inf 

sup 

21 

38960 

41642 

38960 

41680 

38960 

41744 

38960 

42181 

22 

40445 

41651 

40426 

41651 

40392 

41651 

40158 

41651 

23 

48922 

49846 

48903 

49846 

48870 

49846 

48634 

49846 

24 

38170 

40294 

38126 

40294 

38049 

40294 

37494 

40294 

25 

45775 

48297 

45775 

48340 

45775 

48414 

45775 

48934 

26 

31749 

32179 

31743 

32179 

31735 

32179 

31678 

32179 

27 

37467 

37562 

37465 

37562 

37463 

37562 

37446 

37562 

28 

32613 

34510 

32587 

34510 

32543 

34510 

32242 

34510 

29 

41612 

44350 

41612 

44397 

41612 

44476 

41612 

45035 

30 

37314 

37828 

37309 

37828 

37302 

37828 

37253 

37828 

31 

32637 

33712 

32622 

33712 

32597 

33712 

32423 

33712 

32 

31971 

33907 

31948 

33907 

31910 

33907 

31656 

33907 

33 

32772 

34179 

32754 

34179 

32724 

34179 

32518 

34179 

34 

31505 

31900 

31499 

31900 

31490 

31900 

31423 

31900 

35 

32952 

33281 

32952 

33283 

32952 

33286 

32952 

33304 

36 

35147 

35915 

35147 

35925 

35147 

35943 

35147 

36061 

37 

35997 

38886 

35997 

38932 

35997 

39011 

35997 

39561 

38 

32916 

34008 

32907 

34008 

32892 

34008 

32795 

34008 

39 

30299 

32446 

30274 

32446 

30231 

32446 

29950 

32446 

40 

32447 

33110 

32439 

33110 

32426 

33110 

32340 

33110 

104 


Table  5.4b  (continued) 


(In  Dollars) 


£ = 

.1 

£ = 

.2 

£ - 

.5 

£ - 

= 1 

State 

inf 

sup 

inf 

sup 

inf 

sup 

inf 

sup 

41 

35917 

37920 

35917 

37952 

35917 

38007 

35917 

38386 

42 

37457 

39178 

37433 

39178 

37392 

39178 

37113 

39178 

43 

30223 

31385 

30208 

31385 

30181 

31385 

30003 

31385 

44 

38863 

38985 

38863 

38985 

38863 

38986 

38863 

38986 

45 

36974 

37682 

36974 

37691 

36974 

37706 

36974 

37806 

46 

35615 

38517 

35579 

38517 

35518 

38517 

35111 

38517 

47 

41587 

42194 

41587 

42201 

41587 

42213 

41587 

42291 

48 

39336 

40848 

39336 

40867 

39336 

40900 

39336 

41123 

49 

41236 

41765 

41232 

41765 

41224 

41765 

41177 

41765 

50 

43049 

46890 

42994 

46890 

42900 

46890 

42261 

46890 

51 

44366 

44737 

44366 

44745 

44366 

44757 

44366 

44849 

Table  5.5a  Range  of  Posterior  Mean  of  0*  for  Density  Ratio  Class 
when  a0  = 1 and  /30  = 1 0 


(In  Dollars) 


K = 

=2 

K= 

=6 

K= 

=10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

1 

35339 

38450 

31228 

41677 

28887 

43795 

2 

50698 

56652 

43881 

63947 

39398 

68553 

3 

38802 

41828 

34739 

44414 

32550 

46181 

4 

53020 

56279 

48577 

59580 

45838 

61773 

5 

46062 

51233 

40404 

57775 

36866 

61899 

6 

51710 

57020 

44903 

62638 

40688 

66174 

7 

43001 

44363 

41467 

46142 

40538 

47298 

8 

49985 

53007 

45643 

55203 

42826 

56608 

9 

40516 

42089 

38709 

43999 

37604 

45166 

10 

43398 

45319 

41044 

47518 

39613 

48883 

11 

37303 

40512 

33025 

42826 

30525 

44454 

12 

41371 

43116 

39051 

44876 

37604 

45953 

13 

42097 

43464 

40266 

44660 

39149 

45440 

14 

40704 

43328 

37290 

46100 

35206 

47813 

15 

43847 

47653 

39244 

51952 

36447 

54577 

16 

37643 

39925 

35011 

42709 

33515 

44577 

17 

38530 

42162 

34646 

46930 

32344 

49980 

18 

36326 

39099 

33501 

42840 

31995 

45223 

19 

31955 

33558 

29868 

35135 

28640 

36166 

20 

40599 

43778 

37207 

48068 

35120 

50698 

Table  5.5a  (continued) 


(In  Dollars) 


K: 

=2 

K: 

=6 

K= 

=10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

21 

40447 

44625 

34844 

49190 

31459 

52134 

22 

38674 

41844 

35025 

45539 

32898 

47861 

23 

44755 

50078 

37859 

55709 

33486 

59205 

24 

35982 

39999 

31955 

45620 

29616 

49322 

25 

45929 

50992 

39207 

56164 

35095 

59375 

26 

32502 

35015 

30717 

38794 

29529 

41078 

27 

36603 

37797 

34922 

39062 

33914 

39876 

28 

30289 

32759 

26979 

35041 

25025 

36313 

29 

43997 

49304 

38161 

55744 

34661 

59721 

30 

36419 

38041 

34435 

40026 

33223 

41280 

31 

30357 

32847 

27069 

35016 

25162 

36300 

32 

29975 

32112 

27258 

34108 

25625 

35265 

33 

30063 

32551 

26821 

34397 

24807 

35433 

34 

30572 

32312 

28396 

34092 

27160 

35180 

35 

32168 

34047 

29700 

35859 

28302 

37036 

36 

34516 

37600 

30761 

40914 

28736 

43168 

37 

39132 

42928 

34801 

47720 

32348 

50848 

38 

32353 

33541 

31025 

35012 

30244 

35953 

39 

30363 

33111 

28185 

37075 

26887 

39558 

40 

30740 

32712 

28236 

34389 

26795 

35371 

Table  5.5a  (continued) 


(In  Dollars) 


K: 

=2 

K: 

=6 

K= 

=10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

41 

36997 

40490 

32529 

44172 

29844 

46531 

42 

33753 

36833 

29626 

38937 

27196 

40405 

43 

28368 

30490 

25535 

32384 

23824 

33416 

44 

37026 

41695 

31871 

47537 

29024 

51430 

45 

37086 

40774 

32974 

45572 

30527 

48652 

46 

32426 

35104 

28590 

37595 

26181 

39108 

47 

40186 

42782 

36748 

45140 

34695 

46746 

48 

39534 

42560 

35585 

45807 

33270 

47904 

49 

40087 

41610 

38018 

43146 

36729 

44090 

50 

39459 

43772 

34626 

49113 

31790 

52690 

51 

43124 

47568 

37890 

52609 

34841 

55692 

Table  5.5b  Range  of  Posterior  Mean  of  0*  for  Density  Ratio  Class 
when  a0  = 7 and  (30  — 3 


(In  Dollars) 


K= 

=2 

K= 

=6 

K= 

=10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

1 

37415 

38473 

36148 

39852 

35357 

40744 

2 

52118 

54832 

48964 

58092 

46912 

60068 

3 

41298 

42224 

40298 

43509 

39695 

44346 

4 

55216 

56322 

53884 

57565 

53049 

58319 

5 

45976 

48628 

42561 

51532 

40351 

53295 

6 

54653 

56561 

52646 

58987 

51472 

60497 

7 

43354 

43693 

42996 

44131 

42787 

44400 

8 

52955 

53533 

52340 

54266 

51985 

54722 

9 

40934 

41436 

40438 

42129 

40149 

42580 

10 

44043 

44705 

43282 

45526 

42820 

46046 

11 

39649 

40806 

38300 

42163 

37523 

43059 

12 

42313 

42728 

41805 

43188 

41510 

43490 

13 

43080 

43459 

42626 

43904 

42345 

44190 

14 

41836 

42791 

40672 

43889 

39957 

44551 

15 

44856 

46616 

42781 

48729 

41477 

50093 

16 

38925 

39632 

38335 

40682 

37989 

41333 

17 

39843 

41728 

38580 

44434 

37818 

46024 

18 

36965 

37658 

36262 

38638 

35846 

39257 

19 

32470 

32886 

31957 

33348 

31653 

33667 

20 

40943 

41888 

40001 

43150 

39446 

43893 

Table  5.5b  (continued) 


(In  Dollars) 


K: 

=2 

K: 

=6 

K= 

=10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

21 

41972 

43553 

39801 

45201 

38419 

46173 

22 

39416 

41067 

37562 

43122 

36413 

44445 

23 

47784 

50026 

45403 

52843 

44017 

54562 

24 

37474 

40325 

35755 

44437 

34689 

46848 

25 

47758 

50414 

44277 

53092 

42137 

54708 

26 

31581 

32173 

30906 

32897 

30510 

33400 

27 

37305 

37601 

36926 

37976 

36693 

38233 

28 

31429 

32302 

30373 

33205 

29804 

33864 

29 

45246 

47740 

42459 

50801 

40682 

52744 

30 

37009 

37350 

36532 

37721 

36220 

37953 

31 

31693 

32440 

30692 

33078 

30140 

33428 

32 

31061 

31591 

30389 

32139 

30022 

32477 

33 

31947 

32621 

31118 

33304 

30678 

33793 

34 

31179 

31620 

30647 

32106 

30335 

32429 

35 

33124 

33667 

32486 

34302 

32111 

34725 

36 

36238 

37341 

35078 

38807 

34392 

39776 

37 

41063 

42721 

39864 

45215 

39042 

46726 

38 

32761 

33142 

32445 

33718 

32266 

34087 

39 

29756 

30428 

29108 

31325 

28735 

31922 

40 

32018 

32523 

31366 

33014 

30996 

33352 

Table  5.5b  (continued) 


(In  Dollars) 


K= 

=2 

K= 

=6 

K= 

=10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

41 

38526 

39808 

36966 

41270 

36014 

42204 

42 

36745 

37985 

35528 

39662 

34798 

40696 

43 

29470 

30044 

28746 

30616 

28358 

30956 

44 

39599 

42305 

37375 

46304 

36107 

48765 

45 

38349 

39587 

37316 

41386 

36685 

42446 

46 

34691 

35735 

33738 

37248 

33183 

38214 

47 

41685 

42659 

40406 

43696 

39628 

44386 

48 

41167 

42286 

39959 

43698 

39253 

44590 

49 

40886 

41279 

40377 

41700 

40060 

41964 

50 

41023 

42586 

39343 

44542 

38362 

45715 

51 

44604 

46627 

42374 

49173 

41012 

50744 

Table  5.6  Range  of  E[6{  \ y,r2]  for  the  Class  of  Uniform  Priors 


(In  Dollars) 


0 < T 

2 < I 

0 < T 

2 < 5 

0 < r2  < 10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

1 

37495 

38037 

37495 

38037 

37495 

38037 

2 

45870 

51893 

45870 

52535 

45870 

52626 

3 

38390 

41348 

38390 

41460 

38390 

41475 

4 

49191 

55414 

49191 

55664 

49191 

55696 

5 

42058 

46385 

42058 

46941 

42058 

47022 

6 

51718 

54697 

51718 

54970 

51718 

55009 

7 

43480 

44639 

43444 

44639 

43439 

44639 

8 

53156 

53332 

53134 

53332 

53131 

53332 

9 

39919 

40977 

39919 

40987 

39919 

40989 

10 

39308 

44084 

39308 

44209 

39308 

44225 

11 

38932 

40203 

38932 

40277 

38932 

40287 

12 

42606 

43215 

42593 

43215 

42592 

43215 

13 

42014 

43226 

42014 

43253 

42014 

43256 

14 

39407 

42092 

39407 

42198 

39407 

42212 

15 

42026 

45126 

42026 

45366 

42026 

45399 

16 

36347 

38604 

36347 

38689 

36347 

38700 

17 

38565 

39213 

38565 

39229 

38565 

39230 

18 

32510 

36767 

32510 

36925 

32510 

36945 

19 

31720 

32606 

31720 

32610 

31720 

32610 

20 

35678 

40608 

35678 

40834 

35678 

40864 

Table  5.6  (continued) 


(In  Dollars) 


0 < r 

2<1 

0 < T 

2 < 5 

0 < r2  < 10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

21 

36356 

42152 

36356 

42644 

36356 

42712 

22 

40174 

42403 

39898 

42403 

39859 

42403 

23 

48651 

50140 

48345 

50140 

48300 

50140 

24 

37535 

41159 

36810 

41159 

36701 

41159 

25 

43795 

48897 

43795 

49532 

43795 

49623 

26 

31682 

32750 

31621 

32750 

31612 

32750 

27 

37447 

37590 

37428 

37590 

37426 

37590 

28 

32263 

36279 

31927 

36279 

31881 

36279 

29 

39461 

44996 

39461 

45676 

39461 

45774 

30 

37256 

39608 

37207 

39608 

37200 

39608 

31 

32435 

34650 

32240 

34650 

32213 

34650 

32 

31673 

36497 

31400 

36497 

31364 

36497 

33 

32532 

35693 

32307 

35693 

32276 

35693 

34 

31427 

32145 

31351 

32145 

31341 

32145 

35 

32020 

33303 

32020 

33315 

32020 

33316 

36 

34309 

36053 

34309 

36183 

34309 

36201 

37 

33701 

39523 

33701 

40172 

33701 

40264 

38 

32801 

37643 

32704 

37643 

32692 

37643 

39 

29969 

35302 

29667 

35302 

29627 

35302 

40 

32345 

34012 

32252 

34012 

32240 

34012 

Table  5.6  (continued) 


(In  Dollars) 


0 < T 

2 < I 

0 < T 

LO 

VI 

<N 

0 < r2  < 10 

State 

inf 

sup 

inf 

sup 

inf 

sup 

41 

34449 

38360 

34449 

38804 

34449 

38867 

42 

37132 

40725 

36819 

40725 

36776 

40725 

43 

30015 

32529 

29816 

32529 

29789 

32529 

44 

38555 

38986 

38555 

38986 

38555 

38986 

45 

36098 

37800 

36098 

37909 

36098 

37923 

46 

35138 

41886 

34698 

41886 

34639 

41886 

47 

40651 

42286 

40651 

42370 

40651 

42381 

48 

37603 

41108 

37603 

41352 

37603 

41385 

49 

41180 

43394 

41133 

43394 

41127 

43394 

50 

42305 

50345 

41581 

50345 

41481 

50345 

51 

44031 

44842 

44031 

44962 

44031 

44980 

CHAPTER  6 


SUMMARY  AND  FUTURE  RESEARCH 
6.1  Summary 

In  this  dissertation,  we  have  considered  robust  Bayesian  inference  in  the  context  of 
finite  population  sampling.  It  turns  out  that  the  robust  Bayes  estimators  enjoy  both 
posterior  and  procedure  robustness  for  a fairly  general  class  of  priors.  The  subjective 
Bayes  estimator  enjoys  posterior  robustness,  but  fails  miserably  under  the  criterion 
of  procedure  robustness.  The  sample  mean  is  inferior  to  both  under  the  criterion  of 
posterior  robustness,  but  enjoys  certain  amount  of  procedure  robustness.  Thus,  our 
recommendation  is  to  give  the  robust  Bayes  estimators  every  serious  consideration  if 
one  is  concerned  with  both  Bayesian  and  frequentist  robustness.  The  other  important 
finding  is  that  when  the  sampling  fraction  goes  to  zero,  that  is,  we  are  essentially  back 
to  an  infinite  population,  several  asymptotic  optimality  properties  of  the  estimators 
of  Berger  and  Berliner  (1986)  hitherto  unknown  in  the  literature  are  also  established. 
Also,  modifying  slightly  the  arguments  of  Siverganesan  and  Berger  (1989),  we  have 
provided  the  range  where  the  posterior  mean  belongs  under  e-contamination  models. 
This  pertains  to  the  sensitivity  analysis  of  the  proposed  robust  Bayes  procedure. 
Similar  results  are  proved  in  the  presence  of  auxiliary  information  as  well.  As  another 
issue  in  the  prior  robustness,  we  propose  inherently  robust  Bayes  estimators  of  the 
finite  population  mean  based  on  heavy-tailed  priors  using  scale  mixtures  of  normals. 
The  .implementation  have  been  illustrated  by  adopting  a Monte  Carlo  integration 
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technique  known  as  the  Gibbs  sampler.  Also,  asymptotic  optimality  properties  of  the 
proposed  estimators  are  proved  as  the  sample  size  increases. 

We  have  also  considered  Bayesian  robustness  in  the  context  of  small  area  estima- 
tion. We  provide  robust  Bayes  estimators  of  the  small  area  means  using  ML-II  priors. 
We  compare  these  RB  estimators  with  HB  and  EB  estimators  by  anlyzing  the  real 
data.  It  turns  out  that  RB  estimates  are  inferior  to  both  HB  and  EB  estimates.  We 
provide  the  range  where  the  small  area  means  belong  under  e-contamination  class  as 
well  as  the  density  ratio  class.  The  ranges  are  fairly  small  and  robustness  seems  to 
be  achieved  using  both  classes  in  some  cases. 

6.2  Future  Research 

We  have  considered  Bayesian  robustness  in  finite  population  sampling  using  a 
class  of  priors  when  we  have  a single  characteristic  for  each  unit  in  the  population. 
A useful  research  will  be  to  provide  a suitable  extension  of  this  type  of  problem  by 
considering  the  multivariate  case  where  we  have  several  characteristics  of  interest.  We 
can  also  explore  the  situation  when  we  have  more  than  one  auxiliary  characteristic 
available.  Moreover,  we  can  also  extend  the  robustness  issue  using  heavy-tailed  priors 
in  the  presence  of  auxiliary  information. 

We  have  studied  Bayesian  robustness  in  small  area  estimation  using  a class  of 
priors  for  Fay-Harriot  model.  We  can  explore  Bayesian  robustness  in  small  area  esti- 
mation for  more  general  HB  model  given  by  Datta  and  Ghosh  (1991).  The  robustness 
study  using  heavy-tailed  priors  to  this  general  setting  is  one  important  issue. 
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